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ABSTRACT 


Algebraic geometry is the study of systems of polynomial equations in one or 
more variables. Thinking of polynomials as functions reveals a close connection between 
affine varieties, which are geometric structures, and ideals, which are algebraic objects. 
An affine variety is a collection of n- tuples that represents the solutions to a system of 
equations. An ideal is a special subset of a ring and is what provides the tools to prove 
geometric theorems algebraically. In this thesis, we establish that a variety depends on 
the ideal generated by its defining equations. The ability to change the basis of an ideal 
without changing the variety is a powerful tool in determining a variety. In general, 
the quotient and remainder on division of polynomials in more than one variable are not 
unique. One property of a Groebner basis is that it yields a unique remainder on division. 

To prove geometric theorems algebraically, we first express the hypotheses and 
conclusions as polynomials. Then, with the aid of a computer, apply the Groebner Basis 
Algorithm to determine if the conclusion polynomial(s) vanish on the same variety as the 


hypotheses. 
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Chapter 1 


Introduction 


Algebra and geometry are often viewed as two distinct branches of mathemat- 
ics. When we find connections between these and other branches of mathematics, our 
understanding of mathematics is enriched. Algebraic geometry is the study of solutions 
of systems of polynomials in one or more variables. The solution set is a collection of 
ordered n-tuples, called an affine variety. Affine varieties are curves, surfaces and higher 
dimensional objects defined by systems of polynomial equations. “The ability to regard 
a polynomial as a function is what makes it possible to link algebra and geometry.” 
[CLO15]. 

The goal of this thesis is to demonstrate how the hypotheses and conclusions of 
theorems that describe characteristics of certain geometric structures can be translated 
to a system of polynomial equations and how algebraic methods such as the Groebner 
Basis Algorithm can be applied to determine if the theorem can be confirmed or rejected. 
This will involve the study of polynomial rings over a field k and ideals. In particular we 
will determine if a polynomial is an element of a particular ideal. Given a polynomial f € 
k[x1,22,...,%n] and an ideal generated by fi, fo,..., fs; denoted by I = (fi, fo,..., fs), 
we will explore how we can determine if f € J. This is related to the geometric problem 
of determining whether a variety V(f) lies on the variety V(f1, fo,..., fs). We will show 
that solving a system of polynomial equations, that is, finding all common solutions in 


k” of a system of polynomial equations 


Fin Coy ag) = fi Coe Oy a eS toe ae) = 


is equivalent to finding the n-tuples in the affine variety V(/fi, fo,..., fs). 


The thesis begins with definitions and fundamental results of the relevant al- 


gebraic and geometric objects. Ideals give us the structure for computing with affine 
varieties. We will study the correspondence between ideals and varieties. In particular, 
we will explore the mappings from ideals to varieties and varieties to ideals and determine 
which mappings represent bijections. 

When working with multivariable polynomials, we encounter difficulties that 
are not present when working with single variable polynomials. For example, different 
orderings on monomials can yield different quotients and remainders when using a division 
algorithm on polynomials. Deriving a convenient basis for an ideal is an important step 
in proving geometric theorems algebraically. We will define and study the properties of 
Groebner bases and demonstrate how strategic use of technology allows us to efficiently 
verify theorems in Euclidean geometry. Several computer programs such as Mathematica 
and Sage have applications that quickly perform algorithms including polynomial division 
and computing a Groebner basis. 

The thesis culminates with proving three theorems involving triangles in the 
Euclidean plane using technology to implement the Groebner Basis Algorithm. The first 
example we will demonstrate is the proof that the three medians of a triangle meet at a 
single point. This point is called the centroid of the triangle. Next, we demonstrate the 
proof that the three altitudes of a triangle meet at a single point, called the orthocenter 
of the triangle. The final example demonstrates the proof that the centroid, orthocenter 


and circumcenter of a triangle are collinear. This line is called the Euler Line. 


Chapter 2 


Algebra and Geometry 


Preliminaries 


2.1 Polynomials and Affine Space 


Using the Groebner Basis Algorithm involves solving a system of polynomial 
equations with any degree and with any number of variables. The examples presented in 
this thesis will contain more than two or three variables. The coefficients will be from 
a field k. In this section, we define the polynomials to be studied. Since each term of a 


polynomial is a monomial, we begin by defining a monomial. 


Definition 2.1. [CLO15] A monomial in 21, 2, ...,%p is a product of the form 


where all of the exponents a1,...,@,, are nonnegative integers. The total degree of the 


monomial is the sum a, +---+a,, and is denoted |a}. 


To make the definition of a polynomial in k[x1, x2,...,%n] less cumbersome, we 
simplify the notation for the monomials as follows. Let a = (a1, @2,...,@n) be an n-tuple 
of positive integers. Then we will express xf - x5?--- a9" as x%. The total degree of the 


monomial, as indicated above, is |a] = aj +-:-+a,. When a polynomial has fewer than 


five variables, we will replace the x1, #2, 23,24 with x,y, z, w. 


Definition 2.2. [CLO15] A polynomial f in x1, 22,...,%, with coefficients in k is a finite 


linear combination (with coefficients in k) of monomials. 


We will express a polynomial f in the form 
PS Gat Baek 
a 


where the sum is over a finite number of n-tuples a = (a1,Q2,...,Q@n,). The set of all 
polynomials in x1, £2,..., 2%, with coefficients in k is denoted k[21, x2, ..., %,]. For example, 
a familiar polynomial ring used in much of mathematics at the secondary level is R{:], 
the ring of polynomials over the field of real numbers. 

The basic terminology we will use when working with polynomials is given in 


the following definition. 
Definition 2.3. [CLO15] Let f = >> aqx® be a polynomial in k[x1,22,...,2n]. 
a 
1. We call ag the coefficient of the monomial xo, 
2. If ag £0, then we call aga a term of f. 


3. The total degree of f, denoted deg(f), is the maximum |a| such that the coefficient 


Qo iS nonzero. 


As an example, the polynomial f = 3a°y?z + 223 — 7x* + 9a?y* has four terms. 
The coefficients are 3, 2, —7, and 9. Notice that f € R[x,y,z]. The total degree of f 
is 6. There are two terms of f, 32y?z and 927y*, that have a total degree of 6. This 
presents a dilemma with ordering the terms of multivariable polynomials that is not an 
issue with polynomials in one variable. Ordering the terms of multivariable polynomials 
will be discussed further in Chapter 3. 

Although the coefficients of the polynomial are from a field k the collection of 
polynomials k[x1,x2,...,2%n] is not a field, but is a ring called the polynomial ring over 


the field k. We examine the difference in the definitions below. 


Definition 2.4. [CLO15] A commutative ring with unity consists of a set k and two 
binary operations denoted “+” and “-” defined on & such that for all a, 6, c € k the 


following conditions are satisfied: 
(i) (closure) a+bekanda-bek. 


(ii) (commutative) a+b=b+aanda-b=b-a. It is the commutativity of the second 


binary operation “-” that allows us to identify the ring as a commutative ring. 


(iii) (associative) (a + b) +c=a+t+(b+c) and (a-b)-c=a-(b-c). 


(iv) (distributive) a-(b+c)=a-b+a-c. 


(v) (identities) There exist members 0,1 € & such that a+0 =a and1-a=a. The 
1 is referred to as the unit. It is with the inclusion of this element we have a ring 


with unity. 
(vi) (additive inverses) Given a € k, there is b € k such that a+ b=0. 


Definition 2.5. [Gall7] A field is a commutative ring with unity for which every nonzero 


element a € k, there is b € k such that a-b= 1. 


Consider two arbitrary polynomials f and g from k[x1,22,...,2%,]. Then f +g 
and f-g € ka, x2,...,2,]. Furthermore, it can be shown that the commutative, associa- 
tive, distributive, identity and additive inverse conditions are satisfied. However, it is not 
possible to find a multiplicative inverse for every nonzero polynomial in k[a1,22,..., pn]. 
For example, let f = x—y. Now f € R{z,y]. Although we have g = = such that 
f-g=1, 9 ¢ R[z, y] since g does not satisfy the definition of a polynomial (see Definition 


2.2). In fact, the only polynomials that have multiplicative inverses in k[x,,22,...,2p| 
are the constant polynomials. We conclude that k[x1,22,...,2n] is a commutative ring 
with unity, not a field. We will refer to k[a1, 72,...,2,] as a polynomial ring. 


The next topic to consider is affine space. 


Definition 2.6. [CLO15] Given a field & and a positive integer n, we define the n- 


dimensional affine space over k to be the set of n-tuples with elements in a field 
k” = {(a1,.--,@n) | a1,---,0n € ky}. 


When the field is R we recognize the affine spaces R? and R® from secondary 
mathematics courses in algebra, geometry and calculus. 
Now we explore how polynomials relate to affine space. An important connection 
is that a polynomial f = So agx® € k[x1,2,...,2n] gives a function that maps the n- 
a 


tuple, an element of k” to an element of k. More specifically, the function 
fik™ ok 


is defined as follows: given (a1,a@2,...,@n) € k”, replace every x; with a; in the ex- 
pression for f. Since all of the coefficients, ag € k, this operation yields an element 


FG Gas 6748p) Ek. 


Example 1. A task in R? presented in a secondary algebra class, for example, might 
require one to find solutions to y = x7. Students would make a list of the ordered 
pairs such as (0,0), (1,1), (2,4) that satisfy the equation. They may also represent the 
equation geometrically by sketching the parabolic curve that represents all solutions to 
the equation. We can generalize this task by expressing the equation as f(x,y) = y— 2”, 
a function that maps ordered pairs to real numbers. For example, f (3,17) = 17—3? = 8. 
The ordered pair (3,17) is not a point on the parabola defined by y = 2?. The only 
ordered pairs that will be on the curve are those that are mapped to zero. Since f (2,4) = 
4—2? = 0, the point (2,4) will be on the curve. The secondary school task is a special 


case of this mapping in which we map n-tuples to the zero element of the field. 


When we think of a polynomial as a function, the connection between algebra 


and geometry becomes more detectable. 


2.2 Affine Varieties 


The study of algebraic geometry has at its core the solution set of a system 
of polynomials. These solution sets are known as affine varieties. The definition and 


examples are the topic of this section. 


Definition 2.7. [CLO15] Let & be a field, and let f;, fo,..., f; be polynomials in 


k|a1,22,...,%p]. Then we set 
Vi fifo fey Hie tn ER | J aeoss..4 Oy) =O forall bw st. 
We call V(f1, fo,..., fs) the affine variety defined by f:, fo,..., fs. 


In other words, an affine variety is a collection of n-tuples representing the 
solutions to a system of equations. We are specifically interested in the n-tuples that 
Sabisty Fri, eo, ese) — 0s fol history) = Onsite Poe ay,) = Oe Wher we 
have a solution to f(x1,22,...,%n) = 0 we say f vanishes at (x11, 22,...,n). 

We will continue to examine the task posed in Example 1 in the context of a 
variety so that we may extend the understanding of the relationship between the algebraic 
representation (the system of polynomials) and the geometric representation (the ordered 
pairs in the Cartesian Plane). 

Consider V(y — x”). We want to find the collection of ordered pairs (a, y) € R? 


such that y — 2? = 0. As shown above, this is the set of points on the standard parabola 


y = x’ that has its vertex at the origin and is sometimes called the parent graph of 
parabolas. The variety contains infinitely many ordered pairs. This variety has only one 
polynomial but we can include any finite number of polynomials. For more complicated 
systems of equations, there are computer programs available to help us determine a 
solution set which we can interpret as a variety. Technology can also help us represent 


the variety geometrically, particularly if we are working in R? or R®. 


Example 2. Consider V(y—x?, y—«2—2). We want the set of ordered pairs (x, y) € R? 
that satisfy both equations, y — 2? = 0 and y— 2 —2=0. This system can be solved 
using substitution and yields two solutions, (—1,1) and (2,4), representing the points 


where the parabola and line intersect in the Cartesian Plane. So, V(y—2?, y—a2—2) = 


Ge 1), (2, 4)}. 


It is possible for a variety to be empty. In R?, we would have two curves that 
never intersect. We might also want to know if there are other polynomials that could 
be included in our variety from Example 2. In other words, are there other polynomials 
that vanish at (—1,1) and (2,4)? Before considering this question, we will examine some 


properties of affine varieties. 
Lemma 2.8. [CLO15] If V, W Ck” are affine varieties, then so are VOW and VUW. 


Proof. Suppose V = V(f1, fo,.-..,fs) and W = V(g1,92,--.-,gt). To show that the 


intersection of affine varieties is an affine variety, we use double inclusion to show that 


Wet "V fig eos op ine GE): 


First, we will show VOW C V(fi,..., fs, 91,---, gt). Let’s assume (a1,a@2,...,@n) € 
VOW. This means that (a1,a2,...,@n) € V and (a1, a2,...,@,) € W. In other words, 
fi(a1,a2,...,@n) = 0 for all 1 <i < 8 and gj(a1, a2,...,an) = 0 for all 1 < j <t. So, 
(G5 Ae cok Pa Ge VC foes fas OT o0be 08) 

Now, we show V(f1,...,fs,91,---,91) GC VAW. Assume (a1, a2,...,@n) € 
V(fi,---,fs,91,---,gt). This means f;(a1,a2,...,@n) =0 for alll <i< sand 
gj (a1, 42,...,4n) = O for all 1 < j < t. In other words, the n-tuple (a1, a2,...,an) € V 
and (a1, @2,...,@n) € W. Thus (aj, @2,...,@n) € VM W. We have shown that VM W 
= V(fi,..-, fs, 91,;---,9t) which implies VN W is an affine variety. 

For the second part of the lemma, the union of affine varieties is an affine variety, 


we will use double inclusion to show 


VUWEN (figpl1 Sos 6, Lap St). 


First we will show VUW C V(fig;). Let’s assume (a1, 42,...,@n) € VUW. Then 
(@1,@2,---,@n) € V or (a1, @2,...,an) € W. If (a1, a2,...,an) € V, 
then fi(a1,a2,...,@n) = 0 for all 1 <i <_s. So the product 


Figi(i0052045.Gn) = fi(01; 095<%:..5.0 93 (G1; 2/00, Gp) 
= OG 701500 5.2.40n) 
= 0 


for all 1 < j < t. So, (a1, a2,...,an) € V(fig;). On the other hand, if (a1,a2,...,an) € 
W, then g;(a1, a2,...,@n) = 0 for all 1 < j < t. So the product 


)gj(a1, a2, tee , An) 


)-0 


PG GIO 225 Oe) = Fi Giytdg s8 gy 
=F Wis Wd, aain On 


for all 1 <i<-s. So, (a1,a2,...,@n) € V(fig;). 
Now we show V(fig;) C VUW. Assume (a1, 42,..-,@n) € V(fig;). Then 
figj(@1, 42,.--,4n) = fi(ai, a2,.-.,An)9j (a1, 42,.-.,An) = 0 for all i and j. If 
(a1, @2,---,@n) € V, then (a1,a@2,...,@n,) € V UW and the proof is complete. If 
(a1,@2,...,@n) ¢ V then then exists some ig € {1,...,s} such that fj, (a1, a2,...,G@n) £0. 
But we have, for all j, fi, (@1, @2,.-.,@n)gj(@1, @2,---,@n) = fig gj(@1, 42,..-,4n) = 0 since 
(a1, 42,...,4n) € V(figj). Therefore, gj(a1,@2,-..,@n) = 0 for all j. In other words, 
(a1, 2,...,@,) € W, which means (a1, a@2,...,an) €VUW. Thus V(fig;) C VUW and 
V UW is an affine variety. 


The previous lemma can be extended to state that finite intersections and finite 
unions of affine varieties yield affine varieties. The proof would make use of the lemma 
repeatedly and induction on the number of varieties. For example, suppose V, W, X € k” 
are affine varieties. Let Z = V UW. Then by Lemma 2.8, Z € k” is an affine variety. 
Likewise, ZU X is an affine variety, and so forth. Hence the finite union of affine varieties 
is an affine variety. 

Similarly, lett Y = VW. Then by Lemma 2.8, Y € k” is an affine variety. 
Likewise, Y MX is an affine variety, and so forth. Hence the finite intersection of affine 
varieties is an affine variety. These properties will be useful when we determine a Groebner 


basis for an affine variety. 


2.3 Ideals 


The foundational algebraic object we need in our study of varieties is an ideal, a 
special kind of subset that contains the zero element and is closed under addition. Ideals 
are also closed under multiplication of an element in the ideal by an element in the ring. 


Ideals will give us the structure for computing with affine varieties. 
Definition 2.9. [CLO15] A subset J C k[x1,x2,...,Xp] is an ideal if it satisfies: 
(i) OE T. 
(ii) If f, ge J, then f+gel. 
(iii) If fe f and h € k[zi,20,..., 2], then hf € I. 
So, an ideal of a ring is a subring that “absorbs” elements of the ring. [Gal17] 


Example 3. Consider the ring of integers Z and the subring of even integers 2Z. Since 
0 € 2Z, the sum of two even integers is even and the product of 2 and any integer is even, 


we can say 2Z = (2) is an ideal of Z generated by the integer 2. 
Next we define an ideal generated by a finite number of polynomials. 


Definition 2.10. [CLO15] Let fi, fo,...,f; be polynomials in k[x1,x2,..., 2%]. Then 


we define 


(fis fas---5 fs) = Shahi | Ai, ha,...,hs € Kenta tall 
i=1 


We call (fi, fo,..., fs) the ideal generated by fi, fo,...,f,; and the set of defining equa- 
tions, {f1, fo,..., fs} is called the basis of the ideal. If the ideal is generated by a single 


element, we call it a principal ideal. That is, 
(f) = {hf | he Rit, Bo. eat 
is called the principal ideal generated by f. 


Example 4. In R{z], the ring of polynomials with real coefficients, the ideal, (x) = 
{h-a|h € k[x1,22,...,2%n]} is the subset of polynomials with real coefficients whose 
constant term is zero. The graph of these polynomials pass through the origin. Since this 


ideal is generated by a single polynomial, it is a principal ideal. 


10 


Example 5. Though the ring of integers is not a field, let’s consider the ring of polyno- 
mials whose coefficients are integers. One ideal of this ring is the subring of polynomials 
with constant terms that are even integers. That is, J = (#,2) = {hi-x%+h2-2|hi,h2€ 


Z|21, £2, BAe , En) }. 


The proof of the following lemma justifies that we call (f1, fo,..., fs), defined 


above, an ideal. 


Lemma 2.11. [CLO15] Jf fi, fo,...,fs € kla1,22,...,2n], then (fi, fo,...,fs) is an 


ideal of k)1,;29, x25 2a: 


Ss 
Proof. We will verify that (fi, fo,..., fs) = S> hada. | Rigea es big G Bley Woy. cee eas 
i=1 
an ideal by showing it satisfies the conditions set forth in Definition 2.9. The first condition 


Ss 
is satisfied by noticing that 0 € k[x1,22,...,2p] and 0 = S00: fi. So, 0 € (fi, fo,.--5 fs)- 
i=1 
Now suppose f,g € I and pj, po,.--;Ps,Q1;92;---59s © Kla1,22,...,%p] such that f = 
s s Ss s Ss 


So pifi and g = Safi. Then f +9 = PRs + So afi = So (pi + qi)f;i € I. Finally, 
=i =i =i j=1 jai 


for hi ikl@1,@o5.2032¢m|) hf = nS? pifi — So (hpi) fi € I. Since all conditions are 
i=l i=l 


satisfied, I = (fi, fo,..., fs) is an ideal. 


The proof confirms that if a polynomial can be written as a linear combination 
of fi, fo,...,fs, then it is a member of the ideal (fi, fo,...,f;). This relationship is 
useful when comparing ideals. Let IJ be an ideal such that I C k{a1,22,...,2,] and let 


fi, fa,---y fs © kai, r2,...,2n]. Then the following statements are equivalent: 
(i) fi, fas---5fs € I. 
(ii) Chis Force ot ite) he a 


Suppose, for example, we want to show (x,y) = (a — y, x+y). We have 


a= 5(at+y)+5(x—-y) € (x+y, r—y) and y = 3(x+y)—3(x—-y) € (x+y, x—y). Thus, 


(x,y) C (x+y, x—y). Likewise, x+y = 1-4+1-y € (x,y) anda—y = 1-4-1-y€ (a, y). 


Thus (r+y, xz — y) c (x,y). So (x,y) = (oe y; £— 4). 
We further explore the role of ideals with a proposition that states a variety 


depends on the ideal generated by its defining equations. 
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Proposition 2.12. [CLO15] Jf fi, fo,..., fs; and 91, 92,---, 9 are bases of the same ideal 
ER E0952 33 aly SO WNGE Pig fase ae Fa) = Gis Ga5a224Oy) THER: V is fos ots = 
V(91, 92; --+59t): 


Proof. We have (fi, fo,..-,fs) = (91, 92;---,9t). We will show that V(fi, fo,...,fs) = 
V(91,92;---,9¢) by double inclusion. Let the n-tuple (a1, a2,...,@n) € W(fi, fa,..-, fs). 
Then f1(@1,@2,...,@n) = 0, fo(ai,a2,...,@n) = 0,..., fs(@1,@2,.--,@n) = 0. In other 
words, for all 1 <i < s, f; vanishes at (a1,a2,...,@,). Consider (hifi + hofg +---+ 
hsfs)(@1,@2,---,@n) for some hy, ho,...,hs € klxy,x2,...,¢n]. Since (a1, a@2,...,@n) is 
a point where all f; vanish, (hifi + hofo +--: + hsfs)(a1,02,-.-,@n) = 0. But since 
(fi, fa,--->fs) = (91,92,---,9t), there exist j1,jo,...,j¢ € klx1,22,...,2n] such that 
hi fithafet-:-t+hsfs = jigitjagat::-+Jjege. So (j1gitjogat-+-+5egt) (a1, 42,--+,4n) = 
0. Thus V(g1, 92,---;94) © W(fi, fo,---5 fs). 

Similarly, let the n-tuple (61, be,...,bn) € W(gi, 92,---, gt). Then for all 1 <i< 
t, g; vanishes at (bi, b2,...,bn). Consider (j191 + jogo +--+ + Jegt)(b1, b2,..-, bn) for some 
Ji J2,--+y Jt € klai, v2,...,XLn]. Since (b1, b2,...,bn) is a point where all g; vanish (j1g1 + 
jogo +++++Jege)(b1, b2,---,6n) = 0. But hifithefet+---+hsfs = jigitjogat- +: +59, 
so (hifi thafot-::thsfs)(b1, b2,---,bn) =0. Thus V(fi, fo,.--,fs) © Wg, g2,---5 9). 
Hence, if (fi, fo,.--, fs) = (91, 92,---) gt), then V(fi, fo,.--, fs) = W(g1, 92,---59t)- 


The ability to change the basis of an ideal without affecting the variety is a 
powerful tool in determining varieties. 
Next we will study another class of ideals, ideals of varieties. The ideal of a 


variety consists of all the polynomials that vanish on a given variety. 
Definition 2.13. [CLO15] Let V C k” be an affine variety. Then we set 
WV) S4F-6 klais toy es te) | Feiss, eG Oy) = 0 forall (Ws 005.0200) :E VI}: 
The proof of the following lemma justifies that we call I(V) an ideal. 


Lemma 2.14. [CLO15] If V C k” is an affine variety, then I(V) C k[a1, 22,...,2n] is 
an ideal. We will call I(V) the ideal of V. 


Proof. We will show that I(V) satisfies the three conditions set forth in the definition 
of an ideal. The zero polynomial vanishes on all n-tuples, so in particular it vanishes 


on V. Thus 0 € I(V). Let (a1,a2,...,@n) be an arbitrary element of the variety V. 
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Consider polynomials f,g € I(V). Then by Definition 2.13, f(a1,a2,...,@,) = 0 and 


g(a1, @2,.--,An) = 0. So the sum, 


(f + 9)(a1, @2,---;@n) — f (a1, a2,...,@n) + g(a1,a2,...,An) 


Hence, f +g € I(V). Finally, let f € 1(V) and h € kia, x2,..., 27]. Then the product 


Chef) aig Ga, Ste) 


(Oy tis egy FT (Gila cn OR) 


Hence, hf € I(V). Since all conditions are satisfied, I(V) is an ideal. 


We see that we can construct an ideal by starting with a set of polynomials and 
generating an ideal directly from them or we can consider the set of points on which the 
polynomials vanish and construct an ideal of the variety. It is natural to ask if the ideal of 
the variety is equivalent to the ideal generated by the polynomials. The following lemma 


addresses this question. 


Lemma 2.15. [CLO15] Let fi, fo,...,fs € kla1,22,...,2n]. Then (fi, fo,..., fs) © 
I(V(fi, fo,.--,fs)), although equality need not occur. 


Proof. Let f € (fi, fa,..., fs). Let (a1, @2,...,@n) be an arbitrary element of 
V(fi, fa,.--,fs). We will show f € I(V). If f € (fi, fa,...,fs) then f = So hifi for 


i=l 
some polynomials hy, h2,...,hs € klx1,%2,...,2n]. Then 
f(a1,@2,...,@n) = hifi(ar,...,@n) + hofe(ai,...,@n) +--+: +hsfs(a1,..-,@n) 
= hy (Gye 4 Oa) fi late. s4 On) + Ne (01,224) Gy) Fala 5y) 
+---+hs(a1,...,@n)fs(@1,---,@n) 
= hy(a1,...,@n)-O+ ho(ai,...,€n)-O+---+hs(ai,...,an)-0 
=0. 


So, f € I(V) and (fi, fo,..., fs) © ICV (fi, fo,..-, fs)). Now to show that equality 
need not occur, we present an example for which (f1, fo,..., fs) C I(W(fi, fo, .--, fs)) 
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but I(V(fi, fa,---,fs)) Z (fh, fa,..-,fs). We use the first part of the proof. Consider 
(x?, y?) CI(V(a?, y”)). We can compute I(V(2x?, y?)). Solving the equations 7? = 0 and 
y” = 0 implies V(2?, y?) = {(0,0)}, which is the origin in k?. Then its ideal I({(0,0)}) is 
the set of polynomials that vanish at the origin, that is I(V(x?, y?)) = (a, y). We know 


x ¢ (x?, y*) since any linear combination of x? and y? will consist of monomials having 


total degree at least two. Hence (x, y) ¢ (x?, y”) and so the ideals are not equal. 
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Chapter 3 


Groebner Bases 


To apply algebraic methods to solve geometric theorems such as the ones that 
will be presented in this thesis, we will express geometric structures as systems of poly- 
nonals: f(s wos enry CA) = Oy Pol ei eeases Bay = Ol erg fe Cees eee) =. Oand 
determine the solutions. This is restated geometrically as finding the n-tuples that are 
elements of the affine variety V(f1, fo,..., fs). We showed in the last chapter that the 
variety depends on the ideals generated by the defining polynomials. But we also showed 
that there is more than one basis for an ideal. Deriving a convenient basis is the key to 
solving our system. Groebner bases have the features we desire and in this chapter we 


define and understand the need for Groebner bases. 


3.1 Orderings on the Monomials in k{21, %2,..., Zp! 


We have defined the polynomials we will employ in our task of solving geometric 
theorems algebraically. We now turn to the dilemma mentioned in Section 2.1. When 
working with polynomials in more than one variable, the terms may be arranged in many 


ways. We define three commonly used lexicographic orderings in this section. 


Definition 3.1. (Lex Order). [CLO15] Let a = (a1,...,a@n) and 6 = (b1,...,bn) € 
Zo. We say a >iex f if, in the vector difference a — 6 € Z”, the leftmost nonzero entry 


is positive. We will write 7° >/., 2° if a >Iex B. 


Definition 3.2. (Graded Lex Order). [CLO15] Let a = (a1,a2,...,a,) and 6 = 
(61, b2,..-, bn) € ZSo. We say a >griex B if 
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n n 
la] = >a; > [6] = 52 Gi, or |a| = |B] and a > ter 8. 
i=1 i=1 


Definition 3.3. (Graded Reverse Lex Order). [CLO15] Let a = (a1, a2,...,a@,) and 
B = (01, 62,...,bn) € ZS. We say a >greviex 2 if 


n n 
la| = So ai > (Bl = eer or |a| = |G] and the rightmost 
i=1 i=1 
nonzero entry of a — 6 € Z” is negative. 


We return to our polynomial example from Section 2.1, f(x,y, 2) = 3a3y?z + 
223 — 7x* + 9x7y* to demonstrate the difference in these three monomial orderings. The 
variables will have the order x > y > z. The orderings depend on the exponents of 
the monomials so we first express the exponents of each term as an n-tuple. Let a = 
(3,2,1) represent the term 3x%y?z. We calculate the total degree of this monomial, 
la] =3+2+1=6. For the term 22%, we have 6 = (0,0,3) with |8| = 3. For the term 
—Tx*, we have y = (4,0,0) and |y| = 4. For the term 9x7y*, we have 6 = (2,4,0) and 
|d| = 6. 

To arrange the terms in lex order, we compare the differences between the or- 
dered triples. We look to the leftmost nonzero entry and arrange the triples in descending 
order based on these entries. Since the first entry in y is larger than the first entry in 
the other triples, we conclude that the term —7x* will come first in the lex ordering. 


We see that 6 — a = (—1,2,—1) but a— 6 = (1,—2,1). Comparing the leftmost entries 


informs us that 3a°y?z is next. Since the leftmost entry of 6 is less than the leftmost 


5 
6 


f(x,y, z) = —Tat + 33 y2z + 9x2y4 + 223. We see that in lex order, we do not consider 


entry of the other triples, we list the term 2z° last. So, our polynomial in lex order is 
the total degree of the monomials. Our comparison is based only on the exponent of the 
first variable that has different exponents. This is the ordering one would see in a typical 
secondary algebra course. 

In some cases, it may be advantageous to list the monomial with larger total 
degree first. In grlex ordering, monomials are listed in descending order by total degree. 
If two monomials have the same total degree, then we use lex ordering to decide which 
of those monomials is listed first. With our example, since |a| = |6| = 6 and this total 
degree is greater than that of the other two monomials, we find the n-tuple differences to 
determine which term will hold first and second positions. The leftmost entry in a — 6 is 
1 but the leftmost entry of 6 — a is —1, thus a > 6 in grlex ordering. Next is y followed 


by 6. So, in grlex order, our polynomial is f(x,y, z) = 3x°y?z + 9a7y* — 7x4 + 22°. 
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Now we write our polynomial in grevlex order. As with grlex order, our first 
comparison is with the total degrees. However, when two or more monomials have the 
same total degree, we look at the rightmost nonzero entry of the n-tuple differences. As 
above, we need to decide if 9x?y* or 3x?y?z will be placed first. The rightmost entry in 
a— 0 is 1 and the rightmost entry of 6 — a is —1. According to grevlex order, 6 > a 
and our polynomial is f(x, y,z) = 9a7y* + 323 y?z — 7x* + Tae Once we have selected a 
monomial ordering, we can express a polynomial without ambiguity, which is especially 


helpful in addressing the following terminology. 
Definition 3.4. [CLO15] Let f = 3 Aqx* be a nonzero polynomial in k[x1,2%2,..., np] 
and let > be a monomial ordering. 


1. The multidegree of f is 
multideg(f) = max(a € ZS : da # 0). 
The maximum is taken with respect to the monomial ordering, > 
2. The leading coefficient of f is 
LC(f) = Gmuttiaeg(f) © *- 
3. The leading monomial of f is 
LM(f) = e™ultides(s) 
(with coefficient 1). 
4. The leading term of f is 
LT(f) = LC(f)- LM(f). 


We return again to our polynomial and illustrate the above terminology. With 
respect to graded lex order, f(x,y, z) = 3x3y?z + 9x7y* — 7x4 4 2 23, Then 


multideg(f) = - 1 
LC(f) = 
LM(f) = 
Lr( f= oo 


Identifying the leading term of polynomials is necessary when we want to work 


systematically with a system of polynomials. 
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3.2 Division Algorithms 


One systematic way we work with polynomials is to determine if one polynomial 
can be expressed as a product of other polynomials. When working with polynomials of 
one variable, we may use a division algorithm much like the division algorithm for integers 
one learns in primary school. Here we present a formal theorem describing the Division 


Algorithm for Integers from a Number Theory text. 


Theorem 3.5. (Division Algorithm for Integers). [Burll] Given integers a and b, 


with b #0, there exist unique integers q and r satisfying 
a=q:b+r 0<r< dj. 


The integers q and r are called, respectively, the quotient and remainder in the division 
of a by b. 


Consider, for example, the integers —47 and 5. We can write —47 = (—10)(5)+3. 
We could also write 5 = (—47)(0) + 5. We typically take the integer with the smaller 
absolute value as the divisor and take special notice of pairs of integers for which this 
division algorithm results in a remainder of zero, such as with a = 32 and b= 8. We can 
write 32 =4-8+0. We say 4 and 8 are factors of 32 and 32 is a multiple of 4. 

Now a division algorithm for polynomials in k[z] that is analogous to the algo- 


rithm for integers is given by the following: 


Theorem 3.6. (Division Algorithm for k/z]). [Gall7] Let k be a field and let f(x), 

g(x) € kia] with g(x) #4 0. Then there exist unique polynomials q(x) and r(x) in k[a] 
such that f(x) = q(x) - g(x) + r(x) and either r(x) = 0 or the degree of r(x) is less than 
the degree of g(x). 


The polynomials g(a) and r(x) are called the quotient and remainder in the 
division of f(x) by g(x). When the ring of coefficients of a polynomial ring is a field, we 
can use the long division process to determine the quotient and remainder, as shown in 


the examples below. 


Example 6. Let f(x) = 324 + 223 — 2? — x — 6 and g(x) = x7 + 1 be polynomials in 
R{z]. Then we can divide f(x) by g(x) as follows, 


ge+1 3x4 + 2x3 x xr—6 


— 324 — 3a? 


Dge = Ag vag 


— 27 — 2 
— Ay? — 32 —6 
Ax? +4 
—3r-—2 


Now we have q(x) = 3x7 + 22 — 4 and r(x) = —3a — 2 so we can write 
f(z) = 304 + 223 — 2? — 2 —6 = (3a? + 2a — 4)(2? +1) + (—3a — 2). 
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With polynomials in a single variable, we note that lex, grlex and grevlex all 


result in the same ordering. As with integer division, we are interested in cases when 


polynomial division yields a remainder of zero as in the next example. 


Example 7. Consider f(x) = x? — 8, g(x) = 2 — 2 € R{z]. We can divide f(x) by g(x) 


as follows, 

a? +2x+4 
g—-2| 2 —8 

— x3 + 22? 

Da? 

— Ix? + dex 

4x —8 
—4x+8 
0 


We can write 2° — 8 = (a7 + 2x + 4)(x — 2) and say 2? + 24 + 4 and x — 2 are 


factors of x? — 8. Furthermore, we can say x? — 8 € (x — 2), that is, 23 — 8 is in the ideal 


generated by x — 2. 


An important feature of division with polynomials with one variable is that 


the quotient and remainder are unique. This is not generally true when we work with 


multivariable polynomials. We notice in the next theorem that a monomial ordering must 


be specified. We will see in the examples that follow that when we change monomial 


ordering, we may get a different remainder. We can also generate different remainders by 


changing the order of the divisors. 
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Theorem 3.7. (Division Algorithm in k[x1,72,...,2,]). [CLO15] Let > be a mono- 
mial order on Z¥o and let F = (fi, fo,.--,fs) be an ordered s-tuple of polynomials in 


k|av1,22,...,%n]. Then every f € k[x1,22,...,2%n] can be written as 


f=ahtefet:::+asfstr, 


where qT © klx1,%2,...,%n] and either r = 0 or r is a linear combination with coeffi- 
cients in k, of monomials, none of which is divisible by any of LT(f,), LT(f2),..., LT (fs). 


We call r a remainder on division by F. Furthermore, if qfi 40, 
multideg(f) > multideg(q fi). 


Example 8. [CLO15] In this example, posed as an exercise in the Cox, Little, O’Shea 
text, we demonstrate the division algorithm for multivariable polynomials. We compute 
the remainder on division of f = a’ y?+a%y?—y+1 by the ordered set F = (xy?—a, r—y?). 
These polynomials are in R{x, y]. We will use grlex order with x > y. Our polynomial f is 
already in grlex order and our divisors in grlex order are f; = cy?—x and fz = —y?+a. We 
will use the graphical representation presented in The Teaching of Mathematics Journal 
[Zeal13]. 


If either LT(f,) or LT(f2) divided 2’, we would repeat the procedure. When a monomial 
ordering is selected, the algorithm always terminates because the multidegree of f is 
reduced in each iteration. In this case we obtain g; = x° + x”, gg = 0 and the remainder, 


r=z'+2?—y+1 and we have f = (x° + 2”) (xy? — 2) + 0(-y? +. 2) + (27 +232 —y+1). 


Example 9. We will work with the same polynomial f and ordered set F' from Example 
8 but this time we will use lex order. The arrangement of the terms of f and f; remain 
f=ax'y?+a3y?-—y+l1 and f; = ry?—2x but fo changes to fo = x—y?. We can anticipate 
a change early in the division procedure since LT(f2) = 2 divides x? so a” will not 
become part of the remainder. The procedure with the lex ordering required 14 iterations 
completed by hand and resulted with q, = 2° + v°y + a4 + ay + xy? + Qe + 2y, qo = 


e+ a°y+a*+aoy+ 2x4 2y, and r = 2y4—y+1. When working with a large number 
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of variables and divisors, computer programs can efficiently produce the quotients and 


remainders. 


The significance of these examples is that when we change the orderings on the 
monomials, the division algorithm yields different remainders. Moreover, if we reorder 
the set F’, we have the potential to yield different remainders. When using lex order and 


using the ordered set F' = (f2, fi) the procedure involved eleven iterations resulting in 
11 


eleven terms in q, with remainder r = —y“? + y y+1 but using this arrangement on 
the set F with grlex order, we obtain the same remainder as in Example 8. 

Since none of our division procedures resulted in a remainder of zero, we may 
be tempted to conclude f ¢ (fi, fo) but we have not exhausted all possible monomial 
orderings. Our next example shows that despite one ordering resulting in a nonzero 


remainder, we cannot conclude the polynomial is not a member of the ideal. 


Example 10. [Zeal3] We will calculate the remainder on division of f = x?y? — 2ry? € 
R[x, y] by the ordered set F = (y? +4, 2?y — 2x) € R[x, y]. We will use lex order with 


x > y as follows, 


(a?y? — 2ey*) : (y> +4) = 2” 
—(2?y? + 42") 
~2Qx7y — 4x? : (x?y — 2x) = 0 


So, we obtain x?y? — 2ry? = 2?(y? + 4) + 0(2?y — 2x) — 2xy? — 427. Suppose, however, 


we reorder the divisors and take F' = (f2, fi). Then we have 


(x?y? — Qay?) + (a?y — 2x) = y? 
—(x?y? — 2ry”) 
0: (y?+4)=0 


This time, we obtain x7y? —2ay? = y?(x?y—2x)+0(y2+4)+0. We observe that 
f, f1, fo are unchanged regardless of which of the three monomial orderings we selected 
but the two cases produced different remainders due to the change in ordering of the 


divisors. 


21 


Hence, we find that f € (f1, fo) despite having a nonzero remainder when di- 
viding f by the ordered set F = (f1, f2). The need for a uniquely defined remainder on 


division is one of the motivations for the definition of the Groebner basis. 


3.3 The Hilbert Basis Theorem and Groebner Bases 


Deciding if a polynomial is in an ideal is referred to as the Ideal Membership 
Problem. In polynomial rings in one variable, the division algorithm uniquely determines 
the remainder and a remainder of zero tells us that a polynomial is a member of an 
ideal. When working with multivariable polynomials, we may obtain a remainder of 
zero with a particular monomial ordering or divisor order and that would be sufficient 
to determine that the polynomial is a member of the ideal. The dilemma is there are 
numerous orderings. The key, then, is to choose divisors which will be generators of 
the ideal that uniquely determine the remainder, regardless of the order. Based on our 
examples, it is reasonable to expect the leading terms to play a critical role in the selection 


of such divisors. This set of divisors is called a Groebner basis. 


Definition 3.8. [CLO15] An ideal I is a monomial ideal if there is a subset A C Z3o 
such that I consists of all polynomials which are finite sums of the form )> hax, where 
he € k[x1,x2,...,2%n]. In this case, we write I = (x* | aE Ac Z3): ace 

In other words, a monomial ideal is an ideal generated by monomials. If a 
monomial «°° lies in a monomial ideal J, then x? must be divisible by x% for some a € A. 
If a polynomial f is a member of a monomial ideal J then each term of f lies in J and f 
is a linear combination of the monomials in J. Another important feature of monomial 


ideals in k[x1,x2,...,2%p] is that they are finitely generated. 


Lemma 3.9. (Dickson’s Lemma). [CLO15] A monomial ideal I = (x* | a € A) C 
k[x1,22,..-,2£n] can be written in the form I = (x™, 7%,..., 2%), where a1, 02,...,Qs5 € 


A. In particular, I has a finite basis. 


Definition 3.10. [CLO15] Let I C k{x1,x2,...,2,] be an ideal other than {0} and fix a 


monomial ordering on the monomials of k[xv1,22,..., Xn]. 


(i) We denote by LT(/) the set of leading terms of elements of J. Thus, LT(I) = 
{cx® | there exists f € I with LT(f) = ca}. 


(ii) We denote (LT(/)) the ideal generated by the elements of LT(J). 
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It is important to recognize that if the ideal J is generated by a finite set of 
polynomials fi, fo,..., fs, then (LT(fi), LT(f2),..., LT(fs)) does not necessarily equal 
(LT(Z)). It follows from the definition that since LT(/) C (LT(Z)) then 
(LT(f1), LT(f2),...,LT(fs)) CG (LT(U)). In the following example we will show that 
(LT(Z)) Z (LT (fi), LT(fe),..., LT (fs)) by finding a member of (LT(J)) that is not a 
member of (LT(f1), LT(f2),...,LT(fs)). 


Example 11. Suppose J = (f1, fo) C R[x, y] where f; = 2ry? — a and fo = 3x2y—y—1 


and use egrlex ordering with « > y. Then 


fz = 3a(2xy” — x) — 2y(8a7y — y — 1) = —3a° + 2Qy? + Qy. 


So, f3 € I and LT(f3) = —3x2? € LT(I). However, since —3x? is not divisible by either 
2Qry? or 3x2y, the leading terms of f; and fo, nor is it a linear combination of these 
monomials, then —3x° ¢ (LT(f1), LT(f2)). 


Proposition 3.11. [CLO15] Let I Cc kia, x2,..., 2%] be an ideal. 
(i) (LT(Z)) ts a monomial ideal. 
(ii) There are gi,92,.--,9: € I such that (LT(Z)) = (LT(g1), LT(g2),..., LT (g)). 


Proof. (i) By Definition 3.4, for polynomial f = So qx € k[x1,22,...,2%n], LC(f) € 


koC Rie, 2oy..@,) and LIC) = LC) = LM(f). By Definition 3.8, the collection of 
leading monomials, LM(f) for f € I \{0}, generate the monomial ideal 

(LM(f) | f € Z \{O}) and since LT(f) and LM(f) differ only by a nonzero constant, 
(LM(f) | f € I \{O}) = (LT(f) | f € I \{O}) which, by Definition 3.10 equals (LT(J)). 
Thus, (LT(J)) is a monomial ideal. 

(ii) By part (i), (LT(Z)) is a monomial ideal generated by LM(f) for f € I \{0}. Dickson’s 
Lemma tells us that a monomial ideal has finitely many generators. That is, monomial 
ideal J = (2, 7, ..., 2°), where a1, @2,...,a¢ € AC Z8p. So, for 91, 92,..., 94 € I, we 
can write (LT(I)) = (LM(g1), LM(g2),..., LM(ge)) = (LT(g1), LT (ga), .--, LT (ge). 


We have shown that monomial ideals can be finitely generated. The next theo- 


rem extends the feature to all ideals. 


Theorem 3.12. (Hilbert Basis Theorem). [CLO15] Every ideal I C k[x1,22,...,2n] 
has a finite generating set. That is, I = (g1,92,-.--, gt) for some gi, g2,---,g¢ € I. 
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Proof. If I = {0} the ideal is generated by the finite set {0}. If, however, J 4 {0}, we can 
specify a monomial ordering on k[x1,22,...,2»] and consider the ideal of leading terms 
of I, (LT(Z)). Then we can employ Proposition 3.11 to construct a finite generating set 
gi, 92,---, 9 € I such that (LT(g1),..., LT (g:)) = (LT()). 

We have gi, 92,---,9: € I such that (LT(g1),...,LT(g:)) = (LT(J)). We will 
show that I = (91, g2,---, 9+) by double inclusion. Since g; € J, 1 <i <t, (91, 92,---594) C 
I. To show that I C (91, 92,---; 94), we let f be any polynomial in the ideal and show that 
f is in (g1,92,---,g). Since f € I, we apply the Division Algorithm for multivariable 
polynomials given in Theorem 3.7 and show that the remainder must be zero. Using the 


selected monomial ordering, dividing f by the ordered set of divisors (91, g2,.--, 94) yields 
f=aigi + a2g2 ++: +age +r 


where no term of r is divisible by any of LT(g;), LT(g2),..., LT (g:). Suppose r 4 0. Our 


result from the division algorithm can be written 


r= f —aigi — age —--- — arge 


which shows r is a linear combination of the members of J. So, r € J which means 
LT(r) € (LT(J)). We have (LT(g1), LT(g2),..., LT(g:)) = (LT(Z)) which implies LT(r) 
is divisible by some LT(g;). This contradicts the definition of a remainder found in our 
division algorithm. We conclude that r = 0, thus 


f = ag) + a2g2 +--+ + a4gt. 


This means f € (g1,92,---,g9t) which shows that I C (g1,92,..-,gt). It follows that 


l= (91,92) +++59t): 


We saw in Example 11 that in general, for an ideal generated by a finite set of 
polynomials gj, g2,..., 9, (LT(Z)) need not equal (LT(g1), LT (g2),..., LT (g:)). However, 
in the proof of the Hilbert Basis Theorem, we constructed an ideal with this property. 
The type of finite generating set, or basis, used is called a Groebner basis and is our next 


topic. 


Definition 3.13. [CLO15] Fix a monomial order. A finite subset G = {91, 92,.--, 9} of 


an ideal I is said to be a Groebner Basis if 


(LT (91), LT (ga), .--,LT(g)) = (LT(Z)). 


The next corollary follows from applying the Hilbert Basis Theorem to (LT(J)). 
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Corollary 3.14. [CLO15] Fiz a monomial order. Then every ideal I C k[x1,22,...,2n] 
other than {0} has a Groebner basis. Furthermore, any Groebner basis for an ideal is a 
basis of I. 


Before describing additional properties of Groebner bases and describing how 
to obtain Groebner bases, we note another consequence, a geometric consequence, of the 
Hilbert Basis Theorem. We recall that affine varieties were described as solution sets of 
systems of polynomial equations. Specifically, from Definition 2.7, 

MU ipfoysoey ts = (Gas 5,255.04) BK” 73 Gi, Gos,<30,,) = 0 for all le eh 
The following definition and proposition allow us to think about affine varieties in the 


context of ideals. 


Definition 3.15. [CLO15] Let J C k[x1,22,...,%n] be an ideal. We will denote by V(J) 
the set 


V2) ={ (Gig... ,4n) ©" | F (Gi, 3.5, G,) = O for all fe I}. 


Proposition 3.16. [CLO15] V(J) is an affine variety. In particular, if I = (fi,..., fs), 
then V(I) = V(fi,.--5fs)- 


Proof. Let I be any ideal of k[az1,...,2,]. Then by the Hilbert Basis Theorem, J has a fi- 
nite generating set. In other words, I = (fi,..., fs) for some f; € I. We will show V(J) = 
V(fi, fa,..-, fs) by double inclusion. By Definition 3.15, V(J) is the collection of n-tuples 
that satisfy the polynomial equations f = 0 for all f € J and hence satisfies the polyno- 
mial equations f; = 0 for 1 <i<s since fi,...,fs € I. Let (a1,a2,...,an) be such an 
n-tuple.. Then. f) (01) Oise On) = 0, Foldr, Gos c2-5Gy) = Opscoy: Sel Qiseayeiss a) = 0: 
Thus V(J) c V(fi, fa,.--, fs). To show V(fi, fo,.-., fs) C W(L), we let (a1, a2,..., an) 
be any n-tuple in V(fi, fo,..., fs) and show that (a1, a@2,...,@n) € V(1). For any f € J, 


he = hifi + ho fo +.--- +hefs where h; € Fleece tetas 
Since fi, fo,...,fs € I, we have 
F(@15025.2040.60n) = Ra O1g ss i Ga VF 1015 s<. On) be hay Oy) Fe (Giger, Gn) 


= hy(a1,...,€n)-O+---+hs(a1,...,Gn)-0 
= 0. 


Therefore, (a1, a@2,...,@n) € V(Z), which implies V(f1, fo,...,f;) C WU). So we con- 
clude V(I) = Vi fi. fe, shag mee 
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Now we have established that varieties are determined by ideals of polynomial 


rings. This is an important link between algebra and geometry. 


3.4 Properties of Groebner Bases 


We identified the need for uniquely determined remainders as a motivation for 
the definition of Groebner bases in Section 3.2. Now we will explore how that definition 
has the quality that the division of a polynomial f by a Groebner basis G yields the same 
remainder regardless of the ordering on the members of the basis. We showed that every 
nonzero ideal has a Groebner basis, so we will be able to determine if a polynomial is a 
member of an ideal by dividing by a Groebner basis and checking to see if the remainder 
is zero. We will not need to wonder if rearranging the divisors will give us a different 


remainder. 


Proposition 3.17. [CLO15] Let G = {g1, g2,..., 94} be a Groebner basis for an ideal IC 
k[ay,22,...,%n] and let f € k[ay,x2,...,¢n]. Then there is a unique r € k[x1,22,...,2n] 


with the following two properties: 
(i) No term of r is divisible by any of LT(gi),..., LT (gz). 
(ii) There is g € I such that f=g+r. 


In particular, ris the remainder on division of f by G no matter how elements of G are 


listed when using the division algorithm. 


Proof. For the set of divisors, G = {g1, g2,..-, 9} C k[x1, 22,...,%n], the Division Algo- 


rithm in k[x,,22,...,%n], states that a polynomial f € k[z1,72,..., 2p], can be written 


f=91+ G92 +++: +49 +7 


with r having precisely the property listed in (i) and qi, q2,...,q@ € k[x1, 22,...,%n]. This 
implies g = q1g1 + q2g2 +--- +9: € I which satisfies (ii). This proves the existence of r. 
We prove that r is unique by contradiction. Suppose r and r’ both satisfy conditions (i) 
and (ii). That is f = g+r = g'+1’. It follows that r—r’ = g’—g € I which implies, as long 
as r and r’ are different, LT(r — r’) € (LT(r — r’)) = (LT (g1), LT (g2),..., LT (gz)). This 
means LT(r —r’) must be divisible by some member of (LT(J)). This is a contradiction. 
Since r and r’ satisfy condition (i), no term of r or 7’ is divisible by any of LT(g;), 1 <7 < t. 


Therefore, r and r’ are not different. In other words, the remainder on division of f by 


G is unique. 
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Although uniqueness of remainders is significant in our effort to determine if a 
polynomial belongs in an ideal, the quotients produced by the division algorithm need 


not be unique. 


Example 12. [CLO15] This example is posed as an exercise in the Cox, Little, O’Shea 
text. We are given G = {x + z,y — z} is a Groebner basis for I = (x + z,y — z), the 
ideal generated by these polynomials using lex order with « > y > z. We divide the 
polynomial f = ry by the ordered set of divisors G = (a+ z,y—z). Then we divide with 


the order of the divisors reversed. First, 


(zy): (@+z) =y 
—(zy + yz) 

=y7: (y= 2) = =e 
—(—yz + 27) 
2 


Tz 


2 


The remainder is —z? so, we obtain zy = y(x + z) — z(y— z) — 2”. 


Now reversing the order of the divisors, 


(zy) : (y-z) =2 


— (xy — x2) 
Ce (Oey Se 
—(xz +27) 


As expected, since we are dividing by a Groebner bases, the remainder here is also —z?. 


However, the quotients were different producing ry = x(y — z) + z(a@ +2) — 27. 


Corollary 3.18. [CLO15] Let G = {g1,92,...,g:} be a Groebner basis for an ideal 
I Chay, 22,...,2n| and let f € k[xy,22,..., 2p]. Then f € I if and only if the remainder 


on division of f by G is zero. 


Proof. (=) If the remainder on division of f by G is zero, then the Division Algorithm 


produces 


f=M91 + Q292 +++: + Gg +0 
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which means f is a linear combination of gi, g2,...,g:. Therefore, f € I. 


(<=) If f € I is given, then f = f +0 satisfies the two conditions of Proposition 3.17 with 


r = 0 the resulting remainder on the division of f by G. 


This corollary leads us to an algorithm for solving the Ideal Membership Prob- 
lem. In other words, when we satisfy the requirement of having a Groebner basis G' for 
the ideal I, we need only to compute a remainder on division by G to determine if f € I. 
We will determine how Groebner bases are derived in the next section. We will use the 


following notation for the remainder. 


Definition 3.19. [CLO15] We will write f for the remainder on division of f by the 
ordered s-tuple F' = (f1, fo,..., fs). If F is a Groebner basis for (fi, fo,..., fs), then we 


can regard F as a set (without any particular order) by Proposition 3.17. 


Example 13. Using our new notation, we can express the remainder r = —z? from 


Example 12, 77° = —2z?. 


Definition 3.20. [Zeal3] Let f, g € k[x1,22,...,XZn] be nonzero polynomials. 


(i) If multideg(f) = a and multideg(g) = G@, then let y = (71,72,---,%n), where yj = 
max(a;, 6;) for each i. We call x7 the least common multiple of LM(f) and LM(g), 
written «7 = lem(LM(f), LM(g)). 


(ii) The S-polynomial (S stands for “syzygy”, from Latin syzygia “conjunction”, or 


Greek, syzygos “yoke together”) of f and g is the combination 


S(f, g) = Neg a7 rT) ‘g. 


Example 14. [Zeal3] Let f = xt*yz+a7y3z+az and g = 2a7y?z+2y? +22? in Q[z, y, z] 
with lex ordering. We have a = (4,1,1) and 6 = (2,2,1) so y = (4,2,1). We compute 


the S-polynomial as follows: 


4, ,2 4, ,2 
Ue Yy*z LY? Zz 
gS = 
(f,9) eae f Daye 2 
1 2 
=uf— 509 
1 3,2 


1 
= at y sue aytz + cyz. 
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The S-polynomial is constructed in such a way that the leading terms of the 
two polynomials cancel each other. This is something the division algorithm did not 
always produce. “Once a basis contains all the possible S-polynomials of polynomials 
in the ideal generating set, there are no extra polynomials in (LT(/)) that are not in 
(LT (gi), LT (g2),..., LT(gs))”. [Zea13] The use of S-polynomials helps develop a criterion 


for determining if a basis is a Groebner basis, as described in the next theorem. 


Theorem 3.21. (Buchberger’s Criterion). [CLO15] Let I be a polynomial ideal. Then 
a basis G = {91,92,---, 94} for Lis a Groebner basis for I if and only if for all pairs i # j, 


the remainder on division of S(gi,g;) by G (listed in some order) is zero. 


Example 15. We return to the set G = {x + z,y — z} given in Example 12 to be a 
Groebner basis of I = (x + z,y — z) using lex ordering x > y > z. Now we will prove 
the set is a Groebner basis using Buchberger’s Criterion. We begin by computing the 
S-polynomial S(g1, gz). Let a = multideg(g1) = (1,0,0) and 8 = multideg(g2) = (0, 1,0). 
Then y = (1,1,0) and «7 = lem(g1, g2) = zy. Now the S-polynomial is 

ry 

( 
= ryt yz—aryt+ xz 


S(g1, 92) = —(a + 2) y— 2) 


=IUzZ+ YZ. 


Now we use the division algorithm to determine if xz + yer is zero. 


—(xz + 27) 
yz—2? > (y-z) =z 
a2 =e) 

0 


The remainder on the division of S(g1, 92) by G is zero. Therefore, by Buch- 


berger’s Criterion, we verify that G is a Groebner basis for the ideal. 


This result may lead us to believe that a Groebner basis of an ideal is simply 
the set of polynomials that generate that ideal. The following example demonstrates that 


is not the case. 
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Example 16. Consider the ideal J = (—a?+ y,—a#? +z) and use lex ordering x > y > z. 
We will show that F = {—2x?+y, —23+z} is not a Groebner basis. We begin by computing 
the S-polynomial S(fi, f2). Let a = multideg(fi) = (2,0,0) and 8 = multideg(fo) = 
(3,0,0). Then y = (3,0,0) and 27 = lem(f;, f2) = 23. Now the S-polynomial is 


3 3 
S(fts f2) = —a(-2? +9) — —a(-a* +2) 
= —2(—a? + y) + (—2? + 2) 


=a—ay—x+z2 


=—-ry+ Zz. 


————_F 
Now we use the division algorithm to determine if (—xy+ z) is zero. 


—ry +z 


The remainder on division of S(f1, fo) by F' is not zero so we conclude that, with the 


monomial ordering specified, F' is not a Groebner basis for the ideal. 


If F had been a Groebner basis, we know changing the order of the divisors 
would not produce a different remainder. However, changing the monomial ordering on 


the set F can produce a different remainder as shown in the next example. 


Example 17. Using the same ideal and generating set as Example 16 but specifying lex 
ordering y > z > 2, we have G = {y — 27, z — x°} with a = (0,1,0) and 8 = (0,0,1). 
Then y = (0,1, 1) and «7 = yz. Now the $-polynomial is 


Now using the division algorithm, 
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(ya* — 20°) : (y—a?) = 2° 
— (yx? — 2°) 
—zx?’+2° : (g-2°) =-2? 
—(—za? + 2°) 
0 


—3 5G : . , : 
Since yx? — zx? = 0, we conclude G is a Groebner basis for the ideal when using lex 


y > Zz > & ordering. 


Buchberger’s Criterion allows us to check a proposed generating set to determine 
if it is a Groebner basis. What shall we do if it is not? Fortunately, we need not guess at 
random. The bi-conditional nature of the criterion suggests a practical way to construct 


a Groebner basis. 


3.5 Buchberger’s Algorithm 


Although we showed that every nonzero ideal has a Groebner basis, we cannot 
simply take the generating set of an ideal and assume it is a Groebner basis. The divi- 
sion algorithm terminates when none of the LT of the divisors can divide the LT of the 
dividend. If we want to extend the process so the division algorithm terminates with a 
remainder of zero, we need additional divisors. Bruno Buchberger, an Austrian math- 
ematician, published an algorithm in 1960 that derives these additional divisors using 
division on S-polynomials. Buchberger invented the name Groebner bases in honor of his 


thesis advisor, Wolfgang Groebner. 


Theorem 3.22. (Buchberger’s Algorithm). [CLO15] Let I = (fi, fo,..., fs) 4 {0} 
be a polynomial ideal. Then a Groebner basis for I can be constructed in a finite number 
of steps by the following algorithm: 
Begin with the set of polynomials defining the ideal, F = { fi, fo,...,fs} and specify a 
monomial ordering. Then, 

1) Compute the S-polynomial S(f;, f;) for each pair { fi, fj} in Fy iF j. 

2) Divide each nonzero S-polynomial by the generating set F’. 

3) If the remainder SG) #0, then adjoin Shafi) to the set F' so 
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BS PIT SCE, fi) }- Whenever S = 0 there is nothing new to add to the basis. 
Repeat steps 1 - 3 until the set F! = F. 


Buchberger’s Algorithm is demonstrated in the next example. 


Example 18. Consider the ideal I = (fi, fo) in k[x,y] where f, = a?y? + xy and fo = 
y' —y?. Using lex ordering with x > y, we compute a Groebner basis using Buchberger’s 
Algorithm. To compute the S-polynomial S(fi, f2), we have a = (2,2), 8 = (0,4) so 
that 2? = x?y*. Now, 


2,4 2,4 
gry xy 
S(fi, fa) = ppey +ry) = ae —y’) 


ay! + ay? — o2yt + ay? 
= gy? + ny. 


Next we divide, 


(x?y? + ay?) : (2’y?+ay) =1 


—(a?y? + cy) 
ay’—ary : (y*—y’?) =0 
20 


—_——__F 

sy®—ay—> S(fi, fr) 

Since S(f1, ral # 0, we revise our basis to include this new polynomial. We'll call it 
fs and notice the remainder on division of $(f,, fo) by the revised set F’ = {f1, fo, fs} 
is zero. But we now need to compute S(fj, fs) and S(fo, fg) to continue the algorithm. 
First, for S(f1, f3) we have a = (2,2), 8 = (1,3). So 27 = xy? and 


2 
x 
S( fu, fa) = Sag (a?? + 2y) — (ay? — ay) 


ry? aie ry? _ ry? pia ay 
= ay + zy. 
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Now, we divide, 


ay+ay? : (yt-y*) =0 


a’yt+ay? : (zy3—zy) =0 


= ae 
ay+ay?— S(fi, fs) 


Since S(fi, fay # 0, we call it fy and add it to our basis. We notice the remainder 
on division of S(f1, fs) by F” = {f1, fo, fs, fa} is zero. We continue the algorithm with 
S(fo, f3) which gives us a = (0,4) and 6 = (1,3). So x7 = ay* and 


4 4 


LY 7 4 2 Ly 3 
S = x 
(fe, fs) yf gy) ae y" — ay) 
= xy? a zy” _ ry" ab xy” 


= 0. 


Since S( fo, f3) = 0, we have nothing new to add to the basis. We still need to compute 
S-polynomials pairing fy = «?y+2y? with each of f; = «?y?+<y, fo = y*—y?, and fs = 
xy? — xy. First, 

2,2 


2,9 
xy xy 
S(fi, fa) = pypey PLU) gy OY + ry”) 


= —axy? + xy. 


We notice that S(fi, f4) = —fs so S(fi, ag = 0 giving us nothing new to add to the 
basis. Proceeding with the the next S-polynomial, 


2,4 
S(fa, fa) = (yt -y*) -—2@? 


Now we divide S( fo, fa) by {fi, fo, fs, fa}, 


Gey ay) ee tag)) a1 
—(-2*y? — zy) 

—sy>+ay : (zy’-zy) = -y 
—(-ay? + zy’) 


—sy>+ay : (sy?—ay) =-1 
—(-ay? + ay) 
0 


—.—F 
Since S( fo, fa) = 0, there is nothing new to add to the basis. So we continue with 


xy? 2,40 
(ay 


S(f3, fa) 


= —2*y — zy’. 


Now we divide S(f3, fa) by {fi, fo, f3, fa}, 


EEyaty oe ya Sl 
—(-2*y — zy”) 

—sy*+ay? : (cy’-ay) =-y 
—(-ay* + zy’) 
0 
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=F : 
Since (f3, f4) = 0, the set is unchanged and there are no more S-polynomials to compute. 


Hence, Buchberger’s Algorithm renders the Groebner basis F = {x?y? +2y, y*—y?, cy? — 


xy, cy + cy} for I = (x?y? + xy, y* — y?) on lex ordering with x > y. 


Buchberger’s Algorithm may render a Groebner basis that has more polynomials 


than necessary. These redundant generators can be eliminated and the reduced set will 


still be a Groebner basis. 


Lemma 3.23. [CLO15] Let G be a Groebner basis of I C k[x1,22,...,X%n]. Let p € G be 
a polynomial such that LT (p) € (LT (G \{p}) ). Then G \{p} is also a Groebner basis 


for I. 


Proof. Since G is a Groebner basis of I, we have (LT(G)) = (LT(Z)). Also since LT(p) € 


( LT(G \{p})) and is a monomial, we know LT(p) is divisible by LT(g;) for some 
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gi © G \{p}. So, by removing LT(p) from the set of generators, we still have a Groebner 
basis for J. That is (LT(G \{p})) = (LT(G)) = (LT(J)). 


The Groebner basis G \{p} will contain a minimal number of elements if all 
polynomials p with LT(p) € (LT(G \{p})) are removed from G. We can further simplify 
the members g; of a Groebner basis by scaling by LOG: Below, we define a minimal 


Groebner basis. 


Definition 3.24. [CLO15] Let G = {g1,...,gs} be a Groebner basis of an ideal J C 
kla1,...,%pn]. Then G is a minimal Groebner basis if and only if for each 7 = 1,2,...,8, 


the polynomial g; is monic and its leading monomial LM(g;) does not divide LM(g;) for 
any 1 4 j. 
Example 19. We return to the ideal J studied in Example 18. I = (x?y? + xy, y* — y’). 


We produced a Groebner basis with four members, 


gn = ay? + xy 

g=y_¥-y 

93 = ty? — ry 

ga = ay + ary? 
The polynomials are all monic but we see that LT(g,) = y- LT(g4) so we can remove gj; 
from the basis to obtain a minimal Groebner basis. This definition considers only the 
lead terms of the Groebner basis but we know the polynomials are linear combinations of 


the monomials in (LT(/)) and so uniqueness of the minimal basis is not guaranteed. For 


uniqueness, we impose an additional restriction on the Groebner basis described below. 


Definition 3.25. [CLO15] A reduced Groebner basis for a polynomial ideal I is a Groeb- 
ner basis G for J such that: 


(i) LC(p) = 1 for allpeG. 
(ii) For all p € G, no monomial of p lies in (LT(G \{p})). 


So, a reduced Groebner basis verifies that the leading term of any polynomial 


in the basis does not divide any term of another polynomial in the basis. 


Example 20. We return once more to the ideal of Example 18 and the minimal Groebner 


basis identified in Example 19, G = {y* — y?, ry® — xy, x?y + xy?}. We can verify that 
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none of the leading terms y*, xy?, «?y divides any terms of the polynomials in G so we 
call G the reduced Groebner basis for I. 


The Ideal Membership Problem was previously mentioned. As we apply alge- 
braic techniques to proving geometric theorems, we will be concerned with determining 
if a polynomial is a member of an ideal. Suppose we are given two ideals in terms of their 
generators. We can determine if they are distinct ideals or the same ideal by examining 
the reduced Groebner basis. If the set of generators of the reduced Groebner basis for 
each ideal is the same, then the ideals they generate are the same. This fact alleviates 


ambiguity in our efforts. 
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Chapter 4 


Algebra-Geometry 


Correspondence 


Now that we have a method to make the study of ideals of polynomial rings 
more manageable, we return to the task of correlating ideals and varieties. This will 
enable the application of algebraic techniques to the proving of the geometric theorems 


to be presented in this thesis. 


4.1 Hilbert’s Nullstellenatz 


An important relationship between ideals of polynomial rings and varieties was 
discovered and proven by David Hilbert in 1900. Nullstellenatz is a German word mean- 
ing theorem of zeros and will help us determine which ideals correspond to varieties. 

In Section 2.3 we constructed a map from an affine variety V C k” to an 
ideal I(V) by defining I(V) to be the set of polynomials that vanish for each n-tuple 
in the variety. Conversely, if we have an ideal I C k[x1,2%2,...,2n], the variety of the 
ideal is the set of n-tuples that make all the polynomials in the ideal vanish. That 
is, V(I) = {(a1,@2,...,@n) € k” | f(a1,a2,...,an) = 0 for all f € I}. Furthermore, 
Proposition 3.16 tells us that V(J) is an affine variety so we have a map from ideals of 
polynomial rings to affine varieties. These two maps give us a correspondence between 
ideals and varieties. It is natural to ask if the maps are one-to-one. We will show in the 
next example that different ideals can give the same variety so the map J + V(J) is not 


one-to one. 


Example 21. Consider the ideals J = (x) and J = (a7) € kia]. Since x ¢ (x7), 


37 


(x) # (x?). However, V(I) = V(J) = {0}. 


In the discussion following Example 2, we noted that a variety with more than 
one polynomial may be empty and that in R? the graphical representation would be 
curves with no points in common. Consider also V(y — x2 — 1) € R?. The solutions to 
y — 2? —1 = 0 are (2,0) and (—i,0) but we would say V(y — 2? — 1) = @ since these 
solutions are not in R?. In fact, the variety of any ideal generated by a polynomial with 
real coefficients that has no real roots will be empty. Our dilemma stems from the fact 
that we are not working in an algebraically closed field. In this section, we will study how 
working in an algebraically closed field eliminates the problem of having different ideals 
represent the empty variety. When working in an algebraically closed field, the only ideal 
whose corresponding variety is empty is the ideal that generates the entire ring. We begin 


with a lemma dealing with the one variable case when the ring is k[2]. 


Lemma 4.1. [CLO15] Let k be an algebraically closed field and let I C k[{x] be an ideal. 
Then V(1) = 0 if and only if I = k{z]. 


Proof. Since k[z] is a principal ideal domain, any ideal in k[z] can be generated by a 
single polynomial. That is, J = (f) for some polynomial f € k|a]. The variety V(J) of 
the ideal is the set of a € k such that f(a) = 0. We are working with an algebraically 
closed field so every polynomial, except the constant polynomials, has a solution a for 
f(a) =0. These solutions are called roots of the polynomial. Therefore, V(I) 4 0 unless 
f is a nonzero constant. In the case where f is a nonzero constant, its multiplicative 
inverse, g = ; is also in k and as a result, 1 =g-f ¢ J. This means p = p-1 € I for 
all p € k{x]. In other words, I = k[x]. Since every nonzero constant polynomial in the 
ring is also in the ideal and we know that a nonzero constant polynomial has no roots, 
the collection of polynomials in the ideal will not have a common root. So, when k is 
algebraically closed I = k[z], the ideal that represents the entire ring, is the only ideal 
that yields V(I) = 0. 


This property also applies to rings of polynomials in more than one variable. 
As long as the coefficients come from an algebraically closed field, the only ideal of 
k[x1,22,...,%p] that yields an empty variety is the ideal that represents the entire ring. 


This result is precisely stated in the next theorem. 


Theorem 4.2. (The Weak Nullstellenatz). [CLO15] Let k be an algebraically closed 
field and let I C kia, %2,...,%n] be an ideal. Then V(I) = Q if and only if I = 


hla a9 on ay |. 
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The proof uses induction on the number of variables and uses the lemma above 
as the base case. An important element of the proof is that 1 € J if and only if J = 
Klaiystoj++52n|s TfL 6 kleijee; ci, then hk = je 1-€ TF for all he kai, 95.2.5. xa|, 
It follows that I = k[a1,20,...,2,]. On the other hand, if J = kiaj,22,...,@], then 1 
€ k[x1,22,...,%y] = I. The remaining details of the proof are given on pages 177 - 178 
of [CLO15]. 

To prove the geometric theorems in this thesis, we will need to show that the 
variety on the hypotheses and conclusions of the theorems is not empty. The polynomials 
that represent the hypotheses and conclusions will need to have a common zero. The Weak 
Nullstellensatz tells us that a variety V(f1, fo,..-, fs) = 0 ifand only if1 € (fi, fo,..., fs). 
We saw in the last chapter that uniqueness is a feature of a reduced Groebner basis for 
an ideal. So we can observe that for any monomial ordering, {1} is the only reduced 
Groebner basis of the ideal (1) = k[x1, r2,..., Up]. 

Although the Weak Nullstellensatz helps us determine if a system of polynomials 
with coefficients in an algebraically closed field has a solution, it does not guarantee that 
if there is a solution, that solution results from a unique ideal. Consider again the ideals 
given in Example 21. We saw that (x) 4 (x?) but V(x) = V(a?) = {0}. Now0€C 
and the field of complex numbers is algebraically closed. Hence, the map I > V(J) is 
not one-to one despite working with an algebraically closed field. We can expand on this 
example to consider multivariable polynomials to better understand why the map is not 
one-to-one. Consider the ideals (a, y) and (x?,y). Since (x,y) ¢ (27, y), (x,y) # (x?, y). 
However, the variety on both these ideals is {(0,0)} € C?. In fact, if we consider any ideal 
of the form (x, y") where m and n are integers greater than one, the variety will be the 
same ordered pair. Thus a reason that the map fails to be one-to-one is that a power of a 
polynomial vanishes at the same point(s) as the original polynomial. The next theorem 
states that over an algebraically closed field, this is the only reason that the map fails to 
be one-to-one. If a polynomial f vanishes at all the points of some variety V(J/) then the 


ideal must contain some power of the polynomial f. 


Theorem 4.3. (Hilbert’s Nullstellensatz). [CLO15] Let k be an algebraically closed 
field. If fi, fa, cei, fs € k[x1, TQ,++- : Zn]; then x € I(V(fi, fa, ee a fs)) of and only if 


re (Fei Jos.ckia fa) 


for some m> 1. 
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Plan for proof. The proof of this theorem is not as straightforward as some other proofs 
we have examined. As such, we pause to engage in a problem solving strategy to help 
give some direction to the proof. We consider what we are given and what we want to 
show. We are given that a function is a member of an ideal. Specifically, the function 
is a member of an ideal of a variety. This means the function has a common zero with 
the other functions in the variety. We want to show that some power of the function is 
in the ideal generated by the functions that are in the variety. To prove this theorem 
we will need to introduce a power of the function f. Strategies learned in secondary 
algebra include raising both sides of an equation to a power and multiplying both sides 
of an equation by the desired power of f. To use either of these strategies, we need an 
equation. We know that if f € J we can write f as a linear combination of the functions 
in the ideal and the functions in the ring. We also must consider that we are working 
with the ideal of a variety and for f to be in the ideal, the variety must not be empty. We 
could attempt to find a solution, an n-tuple in the variety, but this may be a cumbersome 
process. Reflecting on the previous result, the Weak Nullstellensatz, brings to mind that 
it may be easier to prove a variety is empty. Clearly, if f has a common zero with the 
functions in the variety, 1 — f will not have a common zero with those functions. If we 
introduce a new ideal adjoining the original functions with 1 — f, we could show that 
the variety on the ideal is empty. The advantage here is that we could then express 1 
as a linear combination of f and the original functions. So far, we have a plan to show 
the variety is not empty but we have not found a way to introduce a power of f. We 
noted that we could multiply both sides of an equation by a power of f and this seems 
like a good approach since we can write an equation with 1 on one side of the equation. 
However, the other side of the equation would not give us what we desire. We need to 
have a linear combination that does not include f. Again we turn to secondary algebra 
and recall that multiplying by the reciprocal allows us to “cancel” an algebraic object. 
This leads us to introducing an extra variable that will be the multiplicative inverse of f. 

To simplify the notation, a proof for the polynomial ring in two variables will 


be shown. The general proof is given on pages 179 - 180 in [CLO15]. 


Proof. For an algebraically closed field k, if f1, fo,...,fs € k[a1,v2], we want to show 
that given a nonzero polynomial f that vanishes at every ordered pair (a1, a2) for which 
fi, fa,..-,fs vanish there exists an integer m > 1 such that f™ € (fi, fo,..., fs). For 
f™ to be in the ideal, there must be polynomials Aj, A2,...,As € k[x1, 72] so that f™ 


is a linear combination of A; and f; for 1 <i<_s. As described in the plan above, we 
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introduce a third variable x3 and a new ideal I = (f1, fo,...,fs,1—a3f) C k[x1, x2, x3]. 
We will show I = k[z1,x2,x3]. When J is the entire ring, the variety V(J) is empty. 
Let (a1, a2,a3) € k®. There are two possibilities to consider. Either (a1, a2) is a common 
zero of fi, fo,..., fs or it is not. We will show that in either case, V(I) = 0. First, 
if (a1, a2) is not a common zero of fi, fo,..., fs there is some f; with 1 <i < s, such 
that f;(a1,a2) # 0. Evaluating f; at (a1, a2,a3) does not change this result and having 
fi(a1,a2,a3) # 0 tells us (a, a2,a3) ¢ V(I). Hence, in this case, V(I) = 0. 

Now, if (a1, a2) is acommon zero of the polynomials fi, fo,..., fs and we assume 
f €1(V(ft, fo,..-, fs)), then f(a1,a2) = 0. For this case, we will show V(I) = 0 by con- 
tradiction. Suppose (a1, 42,43) € V(L). Since 1—2x3f € I, we expect 1—23f (a1, a2, a3) = 
0. However, 1 — #3 f(a1,a2,a3) = 1— 23-0 = 1. Hence, (a1, a2,a3) ¢ V(L) and we con- 
clude V(J) = 0. Having V(I) = 0 allows us to apply the Weak Nullstellensatz to state 
1 © I. Thus, we can write 1 as a linear combination of polynomials in J and polynomials 
P1,P2,---,Ps,p in the ring k[x1, x2, x3] as follows: 
1 = pi (%1, 22, €3) fi (21, 22) + po(v1, Xa, 3) fo(1, 22) +--+ + ps(@1, F2, 3) fs(L1, 22) + 

p(@1, 2, 03)(1 — x3 f (x1, x2)). 

Recalling from the plan for our proof that the purpose of introducing a third 


variable was to have the multiplicative inverse of our function, we let 73 = j and 


_ 1 
f(x1,x2 
our equation is 


1 = pi(x1, #2, +) fi(v1, 22) + po(w1, 2, )fo(a1,@2) +++ + ps (21, 2, 7) fs(e1,@2)+ 
p(x1, £2, +)(1 _ + - f (a1, @2)). Since 1 — ; - f (a1, 22) = 0, we have 


1 = pi(21, 22, +) fi (x1, 2) + po(x1, £2, +) f2(#1, £2) seth Dal Fis 05 +) fe(a1, 22). 

Now we can multiply both sides of the equation by f™, where m is a positive 
integer large enough to clear all the denominators on the right side of the equation which 
gives us the desired result, 


f™ =Aifit Agfo+-:-+Asfs for polynomials A; € k[x1, x9]. 


The Nullstellensatz helps us express geometric objects with the precise language 
of algebra. The symbolic and numeric manipulation of algebraic objects give practical 


tools for applications such as the proofs included in this thesis. 


4.2 Radical Ideals 


As we continue to explore the connection between varieties and ideals, we con- 


sider the kinds of ideals that can be recognized as the ideal of a variety. In other words, 


Al 


we want to identify those ideals that contain all the polynomials that vanish on some 


variety V. The following lemma reveals an important feature of such ideals. 
Lemma 4.4. [CLO15] Let V be a variety. If f™ € I(V), then f € I(V). 


Proof. Let V be a variety and a be an arbitrary n-tuple in V. We are assuming f” € I(V). 
This means that (f(a))™ = 0. But (f(a))™ = 0 only when f(a) = 0. Since a is an 
arbitrary n-tuple, then f vanishes for alla € I(V). This means f € I(V). 


This lemma tells us that an ideal consisting of all the polynomials that vanish on 
a particular variety V has the property that if some power of a polynomial is a member of 
the ideal, then the polynomial itself is a member of the ideal. We call such ideals radical 


ideals and provide the definition below. 


Definition 4.5. [CLO15] An ideal I is a radical ideal if f™ € I for some integer m > 1 
implies that f € J. 


This definition allows us to restate Lemma 4.4 to classify ideals of varieties as 


radical ideals. More precisely, for a given variety V, I(V) is a radical ideal. 


Definition 4.6. [CLO15] Let J C k{x1,22,...,2n] be an ideal. The radical of I, denoted 
VI is the set {f | f™ € I for some integer m > 1}. 


Lemma 4.7. [CLO15] Jf I is an ideal in k[a1,x2,...,%n] then V/I is an ideal in 


k[x1,%2,...,%n] containing I. Furthermore VI is a radical ideal. 


Proof. To show that J C VI, consider any polynomial f € I. Then f! € I and by 
Definition 4.6, f € VI. Hence, I C VI. 

To verify that VJ is an ideal, we need to show that VI satisfies the three 
conditions of Definition 2.9. Since J is an ideal, 0 € J and since J C VI ,0E€E VI. Now 
suppose f,g € VI. Then by Definition 4.6, there exist positive integers m and n such 
that f™,g" € I. We need to show that f +g € VI. Consider the binomial expansion 
(f +g)™*"-!. From secondary algebra, we have that each term of the expansion has a 
factor f'g’ with i+ j= m-+n-—1. It follows that either i > m or j > n so that either 
f* or 9 is in I. This means fg! € I so that each term of the expansion is in J. Hence, 
(f + g)™*"-! € I and by Definition 4.6, f +g € VI. To satisfy the third condition, 
suppose f € VI and h € k[x1,29,...,%n]. Then f™ € I and h™ € k[x1,29,..., 2p] for 
some integer m > 1. We need to show hf € VI. Since I is an ideal, (hf)™ = h™f™ € I. 
Hence, hf € VI. Since all conditions are satisfied, V7 is an ideal. 
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Finally, to show VJ is a radical ideal, suppose f” € VI. Then by Definition 
4.6, there is some integer n > 1 such that (f”)" = f” € I. This implies f € VT since 
mn > 1. Hence, VJ is radical. 


With the introduction of radical ideals, the relationship between algebraic and 
geometric concepts can be more easily recognized in another form of Hilbert’s Nullstel- 


lensatz. 


Theorem 4.8. (The Strong Nullstellensatz). [CLO15] Let k be an algebraically closed 
field. If I is an ideal in k[a1,x2,...,2n] , then 


Proof. Let k be an algebraically closed field and I C k[x,,x2,...,@p] be an ideal. We will 
show I(V(I)) = VI by double inclusion. First, let f € VI so that f” € I for some integer 
m > 1. This means that f™” vanishes on V(J). In other words, f’ (a1, a2,...,@n) = 0 for 
all (a1, a2,...,@n) € V(J). But f™(a1,a2,...,an) = (f(a1,a2,...,an))” which equals 
zero only when f(a1,@2,...,@n) =0. In other words, f vanishes on V(I) so f € I(V(J)) 
and VI CI(V(J)). 

Conversely, let f € I(V(Z)) so that f (a1, a2,...,@n) = 0 for all (a1, a2,...,an) € 
V(J). Since k is an algebraically closed field, we can apply Theorem 4.3 and conclude 
there exists some integer m > 1 such that f™ € I. Hence, f € VI and I(V(J)) C VI. 


Now we re-examine the maps between ideals and varieties. 


Theorem 4.9. (The Ideal-Variety Correspondence). [CLO15] Let k be an arbitrary 
field. 


(i) The maps 
affine varieties 4, ideals 
and 
: Vv eer 
ideals —> affine varieties 


are inclusion-reversing, t.e., if I, C Ig are ideals, then V(I,) D> V(I2) and, similarly, 
if Vi C V2 are varieties, then I(V,) D I(V2). 


(ii) For any variety V, 
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(iii) If k ts algebraically closed, and we restrict to radical ideals, then the maps 
affine varieties > radical ideals 
and 
radical ideals ~> affine varieties 


are inclusion reversing bijections which are inverses of each other. 


Proof. (i) First, we will show that the map T is inclusion reversing. That is, for varieties Vi 
and V2, we will show Vj C V2 implies I(V2) C 1(Vi). Let f € I(V2). Then f(ai,...,an) =0 
for all n-tuples (a1, @2,...,@n) € Va. Since Vi C Vo, f(ai,a2,.--,@n) = 0 for all n-tuples 
in V;. Hence, f € 1(V,) and I(V2) C 1M). 

Next we will show that the map V is inclusion reversing. For ideals I, and Ig we 
will show I, C Ig implies V(Iz) C V(J,). Consider the arbitrary n-tuple (a1, a@2,...,@n) € 
V(2). Then f(a1,a2,...,@,) = 0 for all polynomials f € Ig. Since I; C Ig, the polyno- 
mials in J; are included in this collection of polynomials that vanish on (a1, a@2,...,@n). 
Hence, (a1, @2,...,@n) € W(1) and V(J2) C V(hh). 

(ii) First we show that the map I is one-to-one by double inclusion. That is, we 
will show for any variety V, V(I(V)) = V. Let V = V(fi, fo,..., fs) be an affine variety 
and let (a1,a2,...,@n) € V. Then for all f € I(V), f(ai,a2,...,an) = 0. This means 
the n-tuple (a1, @2,...,@n) € V(I(V)) and V C V(I(V)). To show the other inclusion, 
we note that each of the polynomials f1, fo,..., fs € I(V) since V contains the n-tuples 
that make these polynomials vanish. Consequently, the ideal (fi, fo,..., fs) C 1(V). In 
part (i), we showed that the map V is inclusion reversing so when we apply the map to 
these ideals, we have V(I(V)) C V((fi, fo,---,fs)) =V. Hence, V(I(V)) = V and T is 
one-to-one. 

We will again use double inclusion to show V(VI) = V(I). We showed in the 
proof of Lemma. 4.7 that I C VI and by part (i) above we know the map V is inclusion 


44 


reversing so we have V(VI) C V(J). For the other inclusion, consider an arbitrary n- 
tuple a € V(Z) and an arbitrary polynomial f € VI. Then f™ € I for some m > 1. This 
means f(a) = (f(a))™ = 0 which implies f(a) = 0. In other words, a € V(VI) which 
means V(I) C V(VI). Hence, V(VI) = V(J). 

(iii) We want to show that when k is an algebraically closed field and I is a 
radical ideal, V(I(V)) = V and I(V(Z)) = J. With Lemma 4.7 and Definition 4.5, we 
concluded that I(V) is a radical ideal and we showed in part (ii) that V(I(V)) =V. We 
know from the Strong Nullstellensatz I(V(I)) = VI so we need to show that when J is 
radical, WI = I. We will use double inclusion to show the equality. Suppose J is radical. 
We showed in the proof of Lemma 4.7 that I C VI. To show the opposite inclusion, 
suppose that f € VI. By Definition 4.5, there exists an integer m > 1 such that f™ € I 
and since I is radical, f € I. Thus, /IJ C I. Consequently, when J is radical, /I = I 
and by the Strong Nullstellensatz, for a radical ideal J, I(V(I)) = I. Hence the maps I 


and V are inverses of each other and define bijections between radical ideals and affine 


varieties. 


This result is pivotal in our effort to use algebraic methods to prove geometric 
theorems. We can now discuss varieties and radical ideals interchangeably when working 
in an algebraically closed field. We will express the hypotheses and conclusion(s) as 
polynomials. Then to determine if a conclusion polynomial vanishes on the same variety 
as the hypotheses, we will check to see if the conclusion polynomial is in the radical ideal 
generated by the set of hypothesis polynomials. The strategy used to prove Hilbert’s 
Nullstellensatz is instrumental in checking for such membership as described in the next 


proposition. 


Proposition 4.10. (Radical Membership). [CLO15] Let k be an arbitrary field and 
let I = (fi,..., fs) C k[a1,..., 2p] be an ideal. Then g € VI if and only if the constant 
polynomial 1 belongs to the ideal I = (f,,..., fs,.1—yg) C k[x1,...,¢n,y], in which case 
T=kla1,...,2n,4]- 


Proof. Suppose g € VI. By definition, g™ € I for some integer m > 1. Since I C I, we 
have g” € I. We also have 1— yg € I. We want to show the polynomial 1 is a member of 
I. Notice that 1 = y™g™ + (1—y'g™) and by factoring the binomial 1 — y'g’™ we have 


L=y"g"+(1—yg)\ltygty?g?t---+y™ 191). 
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We already know y'"g™ € I since g™” € I and y™ is in the ring k[z1,22,...,@n,y]. Also, 
since 1 — yg € J and (l+ygty?g?+-:>+ty™ 1g") € k[x1,22,...,2n,y] we conclude 
that 1— yg’ € I. Hence, by closure, 1 € I. 

Conversely, suppose 1 € I. The proof of Hilbert’s Nullstellensatz (Theorem 
4.3) demonstrates that when 1 € I, g™ can be expressed as a linear combination of the 


polynomials in J and polynomials in the ring k[x,,x2,...,2,]. This means g” € I and 


ge VI. 


4.3. Operations on Ideals 


Many natural operations on varieties correspond to natural operations on their 
ideals. Since ideals are algebraic objects, operations on ideals lend themselves to algo- 
rithms that can be efficiently completed with the aid of technology. This allows us to use 
the convenience of computer programs to work with varieties. Our primary objective in 
the geometric theorems to be presented in the next chapter is to determine if a conclu- 
sion polynomial vanishes on the same variety as the collection of hypothesis polynomials. 
The algorithm to determine radical membership plays a crucial role in our ability to use 
technology to complete this task. 

In this section we define sums and products of ideals and show that these are 


also ideals. Then we show how these operations correspond to operations on varieties. 


Definition 4.11. [CLO15] If J and J are ideals of the ring k[x,,x2,...,2»], then the 
sum of I and J, denoted I + J, is the set 


I+J={ft+g|felandge J}. 


Proposition 4.12. [CLO15] If I and J are ideals in kx, x2,...,%n], then I+ J is also 


an ideal in. Kit; 050025. Vl: 


Proof. To verify that I + J is an ideal, we need to show that J + J satisfies the three 
conditions of Definition 2.9. Since J and J are ideals, 0 € J and 0 € J so that O=0+0€ 
I+ J. Now suppose h,,hg € 1+ J. Then there exist f1, fo € J and gi, 92 € J such that 
hi = fit+gi and he = fo + g2. Now hi + ho = fit git fo+g2 = (fi + fa) + (91 + 92). 
Since J and J are ideals, f; + fo € J and gi + go € J and by the definition of J + J, 
hy+h2 € I+ J. To satisfy the third condition, suppose h € [+ J and p € k[x1, 22,..., Xp]. 
Then, there exist f € I and g € J such that h = f+g. So,p-h=p-(f+g)=p-ft+p-g 
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Since I and J are ideals, p- f € I and p-g € J and by the definition of [+ J, p-heI+J. 


Since all conditions are satisfied, J + J is an ideal. 


Proposition 4.13. [CLO15] Let I = (fi, fo,..., f-) and J = (91, 92,---,9s). ThenI+J 
is the smallest ideal containing I and J. Furthermore, I+ J = (fi,.--, frs G1y+++59s)- 


Proof. Suppose H is an ideal that contains J and J. Then H must contain all polynomials 
f € J and all polynomials g € J. Since A is an ideal, all sums f + g € H. In particular, 
I+ JCH. Therefore, every ideal containing J and J contains J+ J and hence, J+ J 
must be the smallest ideal containing J and J. 

We use double inclusion to show I+ J = (fi,..., fr, gi,---;9s). Since I and J 
are ideals, 0 € J and 0 € J so that (f, +0, fo+0,...,f-+0, O+91,0+g2,...,0+ 95) = 
(fi, fos ++ +5 Firs G15 92)+-+59s) CI+Jd. Now (fi,..., fr, 91,---59s) is an ideal that contains 
Pond J se that Ppa forges ies Dus. PF = Chis cian Fen Qigcety Gs) 


We can generalize the equality of Proposition 4.13 as the following corollary. 
Corollary 4.14. [CLO15] Jf fi, fo,..., fr © klx1, v2,---,2n], then 


(His faye soa dr) = Si) + fap Pee eee 


Next we consider the nature of the variety on the sum of ideals. Since the sum of 
two ideals must contain both ideals, the variety on the sum must include all the n-tuples 
where all polynomials in both ideals vanish. This collection is the intersection of the 


varieties of the two ideals as stated in the next theorem. 
Theorem 4.15. [CLO15] Jf I and J are ideals in k[x1,x2,...,Xn], then 
ViI+ J) = VI) VJ). 


Proof. We use double inclusion to prove the equality. First, suppose the arbitrary n-tuple 
a = (a@1,42,...,@n) isin V(I+ J). Then a € V(J) since J C I+ J. Similarly, since 
JCI+J,a€ ViJ). Hence, a€ ViI)M ViJ). Consequently, VI + J) C ViI)N VJ). 

Now, suppose a € V(I)N V(J). Then a € V(I) and a € V(J) so for all 
f € 1, f(a) = 0 and for all g € J, g(a) = 0. Let h be any polynomial in J+ J. Then 
there exist f € I and g € J such that h = f +g. Thus, h(a) = f(a) + g(a) =0+0=0. 
Hence, a € V(I+J). Consequently, ViI)N V(iJ) C ViI+J) and we conclude V([+J) = 
VIA VJ). 


AT 


Now we explore the product of ideals and show that they correspond to the 


union of varieties. 


Definition 4.16. [CLO15] If J and J are ideals of the ring k[a1,x2,...,%n], then the 
product of I and J, denoted I - J, is the set 


I-J={fig. +--+ fror | fi,.--,fr € I and gi,...,gr € J, r a positive integer}. 


Proposition 4.17. If I and J are ideals in k{x1, x2,...,%n| then I- J is also an ideal in 


Felis 895 . we ate | 


Proof. To verify that I- J is an ideal, we need to show that I - J satisfies the three 
conditions of Definition 2.9. Since J and J are ideals, 0 € J andOeE J. SOO =0-0€ 
I-J since we have a product of an element of J and an element of J. Now suppose 
hy, ho € T- J. Then hy = figi + fogg +---+ frg,y where f; € J andg; € J, 1<i<r. 
Also, ho = fig + fo9o +--+ + fog, where f; € I and gi ¢ J, 1 <j < s. Now the sum 


hy the = figit fogot::-+frort figs + fo95+--- +f) is comprised of terms that are the 
product of an element of J and an element of J. Thus, hy +hg € I-J. To satisfy the third 
condition, suppose h € I- J and p € k[x1,22,...,%n]. Then h = fig: + fog2 +--+ + fr gr 
where f; € J and g; € J, 1 <i<r. So, the product 


p-h=p(figi + fog2 +++: + frgr) 
= pfigi + pfoge +++ + DhrGr 
= (pfi)gi + (pf2)g2 +--+ (Pfr)gr- 


For 1 <i <r, each product pf; € I, since p € kia1,x2,...,@,] and f; € I. Thus, each 


term of the product p-h is a product of an element from J and an element from J so 


that p-h € I- J. Since all conditions are satisfied, J - J is an ideal. 


The product of the ideals J and J is not simply the set of products with one 
factor an element from J and the other factor an element from J. If we defined the product 
this way, we would not have a set that was closed under addition and the product I. J 
would not be an ideal. However, we can express the product J-J in terms of its generators 


when we are given the generators for both the ideals J and J. 


Proposition 4.18. [CLO15] Let I = (fi, fo,..., fr) and J = (91, 92,.--,g9s). Then I- J 
is generated by the set of all products of generators of I and J: 


IJ = (fig) |1S iS 1, 15 <8). 
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Proof. We use double inclusion to prove the equality. First, suppose h € (fig; | 1<i< 
ti a7 <sy, Therh= SS pig fig where pj; € k[t1,%2,...,%n]. Since each fj is a 
member of the ideal J and each p;; is a member of the ring, then each p;; fj € I. Also, 
each gj; € J so that each term of the sum is a product of an element from J and an 
element from J which makes the sum an element of J - J. Consequently, (fig; | 1 <i< 
r, 1<j<s)  CI-J. Now to show the opposite inclusion, we recall that each polynomial 
in J- J is asum of polynomials of the form f-g with f ¢ J and g € J. But we can express 
f in terms of the generators fj, fo,..., f;, and g in terms of the generators g1, 92,..-, 9s: 


For polynomials aj, a2,...,@,,b1, b2,...,6s € k[x1,v2,...,2n], we have 
f=afitarfot---+arf, and g = big; + bog2+---+ dsgs. 


Thus, we can write products of the form f -g as 


f-9 = (ahi t+ aofet-++ + arfr)(bigi + b2g2 +--+ + bsgs) 
= aifibign + ai fibeg2 +--+ + ai fibsgs + a2febigi + a2 fob292 
+++ + dofobsgs +--+ + ar frbsgs 
= (a1b1) figi + (a1b2) figa +--+ + (arbs) figs + (a2b1) fogi + (a2b2) foge 
+--+ + (dads) fags +++ + (arbs) frgs- 


Notice that for 1 <i<rand1< j < 8, ajbj € k[x1,22,..., 2%] so that the product 
f -g is in the ideal generated by the polynomials fjg; for 1 <71<rand1l<j<s. 
That is, products of the form f-g € (figj | 1 <i< 7, 1 < j < s). Consequently, 
Le Pogy | la St, bes). Thos, Pd =(fg;| Tava ley): 


Next we consider the nature of the variety on the product of ideals. The next 
theorem states that the variety on the product of ideals corresponds to the union of the 


varieties. 
Theorem 4.19. [CLO15] Jf I and J are ideals in k[x1,x2,...,Xn], then 
V(I- J) =V(DUV(J). 


Proof. We use double inclusion to prove the equality. First, suppose a = (a1, d2,...,@n) 
is an arbitrary n-tuple in V(J- J). Then for all f € J and for all g € J, f(a)g(a) = 0. 
We consider two possibilities, either f(a) = 0 for all f € J or f(a) £0 for some f € J. If 
f(a) = 0 for all f € J, then a € V(J). If f(a) 4 0, for some f € J, then we must have 
g(a) = 0 for all g € J, in which case a € V(J). In either case, a € V(I) U V(J) and 
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V(I- J) C VI)UV(J). Conversely, suppose a € V(I) UV(J). Then either f(a) = 0 for 
all f € I or g(a) = 0 for all g € J. Thus, f(a)g(a) = 0 for all f € J and for all g € J. So, 
for allh € I- J, h(a) =0. This means a € V(I- J) and V(I) UV(J) C VU: J). Thus, 
V(I- J) =V(DUV(J). 


4.4 Irreducible Varieties and Prime Ideals 


We continue to explore the relationship between algebraic and geometric objects 
by considering one of the most fundamental topics studied in secondary algebra: how to 
know if an algebraic expression is in simplest form and if it is not, how to simplify it. 
This notion of simplifying is learned early in mathematics education. Children in pri- 
mary school learning to make sense of numbers and quantities practice with money and 
are taught the simplest way to make change. They group types of coins and denomina- 
tions of dollars so that they manage the fewest number of items. Students learn how to 
determine if fractions are in simplest form by checking for common factors in the numer- 
ator and denominator. This is extended to rational expressions where the numerators 
and denominators are polynomials that may need to be factored. 

Simplifying algebraic objects is a familiar concept. In secondary geometry, we 
describe points, lines and planes as “undefined” objects but we need to determine what 
it means to simplify geometric objects. We need to know how to recognize when a 
geometric object is in simplest form. Specifically, we need to know how to recognize an 
irreducible variety. We begin this section with a definition and then take advantage of 


the correspondence between varieties and ideals to aid in this endeavor. 


Definition 4.20. [CLO15] An affine variety V C k” is irreducible if whenever V is 
written in the form V = V; UV2, where V; and V2 are affine varieties, then either Vj = V 
or Wy =V. 


We notice that this definition depends on a variety being expressed as a union of 
two varieties. We showed in Section 2.2 that the finite union of affine varieties is an affine 
variety but we still do not have a clear method for decomposing a variety and proving 
that a variety is irreducible. Working with ideals allows us to take advantage of algebraic 


operations so we next explore which types of ideals correspond to irreducible varieties. 


Definition 4.21. [CLO15] An ideal I C klay,22,...,@n] is prime if whenever f, g € 
klaj,22,...,%p] and fg € I then either fe Torgel. 
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Proposition 4.22. [CLO15] Let V C k” be an affine variety. Then V is irreducible if 
and only if I(V) is a prime ideal. 


Proof. First, assume V is an irreducible variety and let fg € I(V). We will show that 
I(V) is a prime ideal. Let Vi = VON V(f) and V2 = VM V(g). We showed in Section 
2.2 that the intersection of two affine varieties is an affine variety so V; and V2 are affine 
varieties. Since fg € I(V), we can write V = Vj UV2. Since V is irreducible, either 
V = Vj or V = Vy. Considering the first possibility, we would have V = V; = VO V(f) 
which implies f vanishes on V. This means f € I(V). If instead, V = Vo, we have 
V =VNV(qg) which implies g vanishes on V and g € I(V). In either case, the definition 
of a prime ideal is satisfied. 

Conversely, assume I(V) is a prime ideal and let V = Vj UV. We will show 
the V is an irreducible variety. Without loss of generality, suppose V # V;. We will 
use double inclusion to show I(V) = I(V2) so that we can conclude V = V2 and satisfy 
the definition of an irreducible variety. Since V = Vj U Vo, it must be true that Vo C V 
and by the Ideal-Variety Correspondence in Section 4.2, I(V) C I(V2). For the opposite 
inclusion, we recall V = V; UV, but that V #4 V;. Hence, it must be true that Vj C V 
and by the Ideal-Variety Correspondence, I(V) ¢ I(V,). This means there must be some 
polynomial f such that f € I1(V;) but f ¢ I1(V). Now let g be any polynomial in I(V2) 
so that g vanishes on V3. As a result, fg vanishes on V2 and since V = Vj U Vo, we 
can conclude fg vanishes on V. This means fg € I(V). Our assumption is that I(V) 
is prime and according to the definition, either f € I(V) or g € I(V). But we chose a 
polynomial f that was not in I(V) which means g € I(V). Hence, I(V2) Cc I(V). This 
proves I(V) = I(V2) and since the map I is one-to-one, V = V2. We have shown that 
ifV =V,UV, and V ¥ Vj, then V = V2 so we can conclude that V is an irreducible 


variety. 


We now have a way to verify that a variety is irreducible by examining the 
corresponding ideal. In the examples presented in this thesis, we will work with radical 


ideals so the following result will make the approach more direct. 
Proposition 4.23. Every prime ideal is radical. 


Proof. Let I be a prime ideal and assume f™ € I for some m > 1. We want to show that 
f © I which would make J a radical ideal. Suppose, on the contrary, that f’ © I but 
m >1so that f ¢ I. Pick the smallest such m. We have f - f”~! = f™ € I and since I 
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is a prime ideal, either f € I or f™ 1 € I. But f ¢ J and by our choice of m, f+ ¢ I. 


This contradiction proves that m= 1, so that f € J. Hence, I is a radical ideal. 


Finally, we can combine the Ideal- Variety Correspondence between radical ideals 


and varieties to get the following corollary to Proposition 4.22. 


Corollary 4.24. [CLO15] When k is algebraically closed, the functions I and V in- 
duce a one-to-one correspondence between irreducible varieties in k” and prime ideals in 


Keay os = oe 
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Chapter 5 


Proofs of Geometric Theorems 


using Algebraic Techniques 


Our objective in this thesis is to demonstrate the proving of geometric theorems 
using algebraic methods. We will introduce Cartesian coordinates in the Euclidean plane. 
We can then write polynomial equations relating the coordinates of a collection of points 
specified in the hypotheses and conclusions of these theorems. When working with large 
systems of equations, having algebraic structure to represent geometric theorems enables 
the use of technology to efficiently verify conclusions of theorems. To use technology, 
however, we must have appropriate inputs and we must know how to interpret the results. 
A computer program can quickly perform algorithms but cannot do the planning. If we 


don’t know what result we want, we won’t know if the theorem is verified. 


5.1 From Geometric Theorems to Polynomial Equations 


The properties of geometric figures such lines, angles, polygons and circles are 
unchanged under translation and rotation in the Euclidean plane so when we introduce 
Cartesian coordinates, we may place the object of our theorem in any convenient location. 
The coordinates of some of the points will be arbitrary and others will be dependent on 
the arbitrary ones. It is common practice to use variables u; to represent arbitrary values 
and x; to represent coordinates determined by the arbitrary ones. 

The following proposition lists the geometric statements we will use in the ex- 


amples presented in this chapter. 


Proposition 5.1. Let A, B, C, D be distinct points in the plane. Then each statement 
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can be expressed by one or more polynomial equations. 


(i) A, B, C are collinear. 

(ii) AB is perpendicular to CD. 
(iii) The distance from A to B is equal to the distance from C to D: AB =CD 
(iv) C is the midpoint of AB. 


Proof. Let A = (aj, a2), B= (b1,b2), C = (c1,c2) and D = (dj, d2) be distinct points in 
the plane. 


(i) The points A, B and C are collinear if the slopes of AB and BC are equal. Using 


the slope formula, we have 


ag—b2 _ be—ce2 
ay—by bi—c1 


From this, we can obtain the polynomial equation 
pi = (a2 — b2)(b1 — c1) — (a1 — b1)(b2 — c2) = 0. 


(ii 


Na 


To show AB | CD, we express AB as the vector (b; — a1, b2 — a2) and CD as the 
vector (dj — ¢,d2 — c2). Then AB | CD means the vectors are orthogonal. In 


other words, the dot product is zero. So we have 
(b1 — ay, b2 — ag) - (dy — C1, dg — c2) = 0 
which gives the polynomial equation 
p2 = (b1 — ay) (dy — €1) + (bz — a2)(dz — cz) = 0. 


To show AB = CD, we use the distance formula to show AB? = CD?. 


NS 


(iii 


AB? — (by _— ay)? + (bg = a2)? 
CD? = (dy _— c)? + (do _— C2)? 


Now AB? = CD? yields the polynomial equation 


p3 = (b1 — a1)? + (b2 — a2)? — (di — 1)? — (dap — )? = 0 
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(iv) If C is the midpoint of AB then A, B and C are collinear and AC = BC so we have 


the polynomial equations from statements (i) and (iii). 


Other common geometric statements such as two lines are parallel, a point is on 
a circle, a line is tangent to a circle and a line bisects an angle can also be expressed as 


polynomials equations. 


5.2 Example 1: The Centroid of a Triangle 


In this example we will demonstrate how the Groebner Basis Algorithm can be 
used to prove a geometric theorem that is taken as given in many secondary geometry 
courses. It is expressed as a definition - that the centroid of a triangle is the point of 


concurrency of the three medians of the triangle. 


Theorem 5.2. Let AABC be a triangle in the plane. If we let M, be the midpoint of BC, 
My be the midpoint of AC and Mz be the midpoint of AB, then the segments AM,, BM» 
and C'M3 meet at a single point M, called the centroid of the triangle. 


First we will express the hypotheses and conclusion as polynomial equations. 
This system of equations is not unique and not all systems yield the desired results. The 
equations listed below are the culmination of multiple attempts and some commentary 
regarding why these equations are favorable is included. 


The theorem is illustrated in the following figure. 


A M3 B 
Figure 5.1: AABC with Medians 


For convenience, we place A at the origin and align the line AB along the hori- 
zontal coordinate axis so A = (0,0) and B = (u1,0). Let C = (ue, ug). The coordinates 


of the midpoints of each side of the triangle are determined by the coordinates of the 
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endpoints. We have My = (21,22), Mz = (#3,24), M3 = (a5,26). The coordinates of 
the centroid are also completely determined so we write M = (27,xg). It is tempting to 
express the midpoints in terms of u; as one might in a typical coordinate proof. This 
would not produce a complete system of polynomials needed to generate the ideal. One 
practical guideline in constructing the system of polynomial equations is to write a poly- 
nomial equation for each dependent coordinate. Now we can obtain polynomial equations 


from the hypotheses: Using the midpoint formula, 


M, is the midpoint of BC: x = ae and 2 = “8 


produces 
hy = 2471 — uj, —u2 = 0 
ho = 2x2 — ug = 0. 
Similarly, 
Mg is the midpoint of AC: x3 = S and r4 = 4 and 
Mz is the midpoint of AB: x5 = S and a6 = 0 
produces 


hg = 273 — u2 = 0 
hag = 224 — uz = 0 
hs = 225 — uy, = 0 


he = x = 0. 


We now describe the position of M as being the point where two of the medians intersect. 


Our conclusion will be that the third median also contains M. Using Proposition 5.1, we 


can write equations to show M is on medians AM, and BM) as follows: 


A, M, M, collinear: “8 = 2 


U7 Ty 


B,M, Mp collinear: 8— = —4 


“L7—-U1 v3-U1" 


Clearing the denominators we obtain two more hypothesis polynomials 


hz = 213 — Lol7 = 0 


hg = x4(47 — u1) — &g (43 — 1) = 0. 
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It is typical that when a hypothesis is properly translated to polynomial equations, the 
number of dependent variables and the number of equations is the same. We now translate 


our conclusion to a polynomial equation: 


: . £g-u3 _ r6—U3 
C, M, Mz collinear: (a 


produces 
g = (5 — u2)(ag — uz) — (a6 — u3)(@7 — uz) = 0. 


In this example, there is only one conclusion. If the conclusion(s) require more than 
one polynomial, we would make sure each conclusion polynomial g; follows from the set 
of hypotheses. Our next step is to show that the conclusion g = 0 holds when the 
hypotheses h; = 0 hold. The polynomial equations in the eleven variables that repre- 
sent the hypotheses h;(u1, v2, ug, 11, 12, 13,4, U5,L6,L7,XLg) are equations that define a 
variety V = V(hy, ho, h3, ha, hs,he,h7,hg) C R''. We want to show that g vanishes 
whenever hi,h2,...,hg vanish. Let I = (hi, he, h3,ha,hs,he,h7,hs). According the 
the Groebner Basis technique, we can use the radical membership test to determine 
if g € \/(hi, ho, hg, ha, hs, he, h7,hg). The conclusion will follow from the hypotheses if 
lel= (hi, ha, h3,ha,hs,he,h7,hg,1— yg). Unfortunately, computing a Groebner basis 


using Sage did not render a basis of {1}. To understand why our technique was not 


successful, we consider a definition. 


Definition 5.3. [CLO15] Let V = V(hj,...,hn). The conclusion g follows strictly from 
the hypotheses hi,...,hn if g € I(V) C Rlw,...,Um,21,..-, Xn]. 


Many geometric theorems have degenerate cases that Definition 5.3 does not 
take into account. For instance, in Theorem 5.2, the centroid example, we cannot allow 
vertex C to be collinear with A and B. Another drawback of this definition is that 
because we are working over R, we do not have an effective method for determining I(V). 
We will proceed with a useful way to check to see if a conclusion follows strictly from a 


set of hypotheses before discussing a criterion that is not as strong. 
Proposition 5.4. [CLO15] Jf g € \/(hi,...,hn), then g follows strictly from hy,..., hn. 


Proof. We are given g € \/(hi,...,hn). This means g® € (h1,...,hn) for some s. Thus 
g° is some linear combination )> h;A;, where polynomials A; are in the ring 


Riui,..-;Um;,1,---,2n]. This means g* and, consequently, g vanish whenever hj,..., An 


vanish. 
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When a conclusion does not follow strictly from a set of hypotheses, possibly due 
to degenerate cases, we will need to decompose the variety into a finite union of irreducible 
varieties, as described in Chapter 4, to determine where the degenerate cases conflict with 
the coordinates that were designed to be arbitrary. That is, we will express the variety 
V=V,UVWU---UV,. This task can be simplified by recomputing the Groebner basis at 
each stage but it still has the potential to be rather lengthy. Furthermore, throughout the 
process, one must examine each polynomial to determine if its appearance in a variety 
conflicts with the construction of the hypotheses and the assumption that we do not have 
a degenerate case. We need to update our strategy so that our method determines the 
validity of a theorem taking into account degenerate cases that may need to be excluded. 

When a polynomial equation involving one of the arbitrary coordinates u; alone 
is a member of an irreducible component of the variety, this means that the function 
h; = u; = 0. In other words, one of the arbitrary coordinates is determined to be zero. 
In this case, we will exclude those components from the union since our intent was for 
those coordinates to be independent. Often this situation reveals a degenerate case. To 


help remedy this concern, we introduce another definition. 


Definition 5.5. [CLO15] Let W be an irreducible variety in the affine space R"™™ with 
coordinates u1,...,Um,21,...,2n. We say that the functions uj1,..., Um are algebraically 


independent on W if no nonzero polynomial in the u; alone vanishes identically on W. 


Another way to state the definition is u,,..., Um are algebraically independent 
on W if I(W)AR[wi,...,Um] = {0}. This allows us to strategically regroup the irreducible 
components of the variety so that the degenerate cases are easily identified. We express 


the variety as follows: 
V=W,U-:-UW,UU,U:--UU,, 


where U1,...,Um are algebraically independent on the components W; and are not alge- 
braically independent on the components of U;. Thus, U; represent degenerate cases of 
the hypotheses of the theorem. So, to guarantee the coordinates u; are indeed arbitrary 


in the geometric configuration being studied, we will only consider the subvariety 
V'=W,U---UW, CY. 


Since we do not want to consider how a conclusion g behaves in the degenerate cases, we 
adjust our definition of what it means for a conclusion to follow a set of hypotheses so 


that it is not as strict. 
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Definition 5.6. [CLO15] The conclusion g follows generically from the hypotheses 
hy,..-,hn if g € 1(V’) C Ri[w,...,Um,21,---,2n], where V’ C R™*” is the union of 
the components of the variety V = V(hi,...,hn) on which the u; are algebraically inde- 
pendent. 


It seems that we have simplified our task to develop a criterion that determines 
whether g vanishes V’. However, we will still need to decompose the original variety V 
which was the task we hoped to avoid. Moreover, if we successfully decomposed V into a 
union of irreducible varieties and identified the desired subvariety V’, we would still face 
the challenge of computing I(V). 

The next proposition suggests that we can determine if a conclusion follows 
generically from a set of hypotheses without knowing the nature of the decomposition 


the variety. 


Proposition 5.7. [CLO15] A conclusion g follows generically from hy,...,hn whenever 


there is some nonzero polynomial c(u1,...,Um) € R[u1,...,Um] such that 
c:-g€e€VH, 
where H is the ideal generated by the hypotheses h; in Riu.,...,Um,1,---,;2n\]- 


Proof. Let V; be one of the irreducible components of V’. We are taking as given that 
c:-g € VH. This means that c-g vanishes on V and thus, c- g vanishes on V;. It follows 
that c-g is in the ideal I(V;). Since V; is irreducible, the ideal I(V;) is a prime ideal so 
that c-g € I(V;) implies either c € I(V;) or g € I1(V;). No nonzero polynomial in u; alone 
vanishes in V; so we know c ¢ I(V;). We conclude that g € I(V;), satisfying the definition 


of what it means for g to follow generically from h1,..., hy. 


Now we have reduced our task to determining if there is a nonzero polynomial 
c with c-g € VH. By the definition of the radical, we have c- g € VA if and only if 


for some A; € R[u,...,Um,21,-.-,%] and s > 1. Since c is nonzero, we can divide both 


sides of the equation by c®. This yields 
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which demonstrates that g is in the ideal H generated by hy,...,hn over the ring 
R(ui,.--,Um)[%1,---,2%n]. In this ring, denominators depend only on the u;. Making 
ui invertible by augmenting our field of coefficients to R(u1,..., Um) in effect removes the 
degenerate cases so that the conclusion follows generically from the hypotheses. On the 
other hand, if g € Vi, then 


n 
g° = > Bihy, 
j=l 


where the coefficients of the B; are in our augmented field R(u1,..., Um). If we clear the 
denominators by multiplying both sides by c*® where c is a least common denominator for 


all the terms in all the B;, we obtain 
n 
/ 
(c- 9) = D0 Bini, 
j=l 


where Bi € Riu,...,Um,21,---,2%n] and c depends only on the u;. Consequently, c-g € 
VH. We now have multiple ways to determine the validity of the conclusion of a theorem. 


The following corollary to Proposition 5.7 lists them. 
Corollary 5.8. [CLO15] The following are equivalent: 
(i) There is a nonzero polynomial c € Riuy,..., Um] such that c-g € VH. 
(ii) gE VE where Hf is the ideal generated by the hj € R(u1,...,Um)[®1,---, En]. 


(iii) {1} is the reduced Groebner basis of the ideal 
(igees fay — 9g) CS Rita. 35 ti) Bisa 55 Das Dl: 


We return now to Theorem 5.2, our example that states the three medians 
of a triangle are concurrent. We will apply part (iii) of Corollary 5.8 to show that 


g = (X5 — U2)(rg — ug) — (46 — U3)("7 — U2) = O follows generically from 


hy = 241 — uz, — u2 = 0 
hg = 2279 — uz = 0 
hg = 273 — u2 = 0 
h4 = 2274 — uz = 0 


hs = 2a5 — uy = 0 
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he = t% = 0 


hz = Lg Lely S 0 


hg = x4(£7 U1) Xg(x3 ut) = 0. 


We use lex order and work in the ring R(w1, ua, ug) (11, £2, ©3, £4, 25, 1G, 7, Tg, yy]. Using 
the computer program Sage to compute the Groebner basis for the ideal (hi, ... hg, 1—yg) 
renders the desired result {1}. Therefore, g € (hi,...,hg). This validates the conclusion 
that the polynomial g represented, the point M defined as the intersection of medians 
AM, and BM, is also on the median C M3. 


Once we refined our strategy to compensate for the degenerate cases, the tech- 


nology did all the tedious division work for us. Not all theorems are suitable for the 
proving using the techniques described above. We typically will need to be able to in- 
troduce a coordinate system. There will be some number of arbitrary coordinates, or 
independent variables in our construction and a collection of dependent variables. Listed 
below is a summary of the steps involved in the Groebner Basis Algorithm for proving 


suitable geometric theorems: 


e Sketch the geometric object(s) involved in the theorem. Label all vertices, intersec- 
tions, and other relevant points. Distinguish arbitrary coordinates using indepen- 


dent variables u; from dependent variables 7;. 


e Determine the polynomial equations that represent the hypotheses h; and conclu- 


sions g;. Expect one hypothesis equation for each dependent variable. 


e With the assistance of a computer, determine if each conclusion follows strictly 
from the set of hypotheses by computing the reduced Groebner basis of the ideal 
(hi,..-;hn, 1 — yg) C Riwi,...,Um,21,---,2n,y]. If the reduced Groebner basis is 
{1} then it follows from the Radical Membership Test that g € \/(hi1,..., An). 


e If the reduced Groebner basis rendered is something other than {1}, make the 
independent variables part of the field of coefficients and compute the reduced 


Groebner basis on the ideal (hi,...,hn,1— yg) C R(u1,..-,Um)[%1,---,2n, y)- 


5.3. Example 2: The Orthocenter of a Triangle 


We have described how to express a geometric theorem as a system of polynomial 


equations so that a computer may be utilized to validate the theorem. In this example, 
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and the next one, the only part of the proof that requires human reasoning is deriving 
the system of polynomial equations. 

In secondary geometry courses, the orthocenter of a triangle is defined to be the 
point at which the three altitudes of a triangle intersect. Consideration is given to the 
seemingly contradictory notion that the center of a triangle might actually be outside of 
the triangle or, in the case of a right triangle, be one of the vertices, but the idea that 
such a point of concurrency exists is taken as given. Here we will prove that all three 


altitudes of a triangle are concurrent. 


Theorem 5.9. Let AABC be a triangle in the plane. Then the lines containing the 


altitudes of the triangle meet at a single point, called the orthocenter of the triangle. 


The altitude from vertex A is the line segment from A meeting the line containing 
the opposite side BC at a right angle. As shown in the figure below, we will call this 
meeting point D and notice that side BC was extended so the intersection point can 
be identified. The altitude from vertex B is the line segment from B meeting the line 
containing the opposite side AC at a right angle. In the figure below, this intersection 
point is labeled E. The altitude from vertex C' is the line segment from C' meeting 
the opposite side AB at a right angle. As shown below, side AB is extended and the 


intersection point is labeled F’. 


Figure 5.2: AABC with Altitudes 


We will again place A at the origin and align AB along the horizontal coordinate 
axis so A = (0,0) and B = (u1,0). Let C = (ug, ug). The coordinates of the foot of the 
altitude from each vertex to the line containing the opposite side of each side of the 


triangle are completely determined by the vertices. We have D = (21,22), E = (x3, 24), 
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F = (5,26). The coordinates of the orthocenter are also completely determined so we 
write H = (x7,28). Now using Proposition 5.1, we can obtain polynomial equations from 


the hypotheses: 


B; Dd; C collinear: —2~— = —3 


v1—-U1 U2—U1 


produces 


hy = £2(ug — U1) — u3(a1 — uy) = 0. 
Expressing the line segments as vectors, we have 
AD 1 BC: AD- BC = (21,22) - (ug — ui, U3) = 0 
which produces 
hg = xQu3 + ©1(u2 — 1) = 0. 
Likewise, 


A, E,C collinear : hg = xau2 — x3u3 = 0 

BE 1 AC: ha = vauz + ug(23 — u1) = 0 
A, B,F collinear : hs = xg = 0 

AB L CF : hg = u2— 25 =0 


Now we identify H as the point where two of the altitudes, AD and BE intersect giving 


two more hypothesis polynomials: 


A, D,H collinear : h7 = r9%7 — 41 xg = 0 


B,E,H collinear : hg = x4(x7 — u1) — 4g (43 — uy) = 0 
The conclusion, H is also on the altitude CF is translated 
C,F,H collinear: g = (26 — u3)(a7 — u2) — (a5 — u2)(xg — uz) = 0 
The reduced Groebner basis for the ideal 
(hi, ha, h3, ha, hs, he, h7,hs,1— yg) C R(us, ua, uz)(271,..., 28, y] 


rendered by Sage using lex ordering is {1} as desired so we conclude g vanishes on 
V (hy, ha, he, ha, hs, he, h7,hg) which validates the conclusion that the point H is on alti- 
tude CF. 
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5.4 Example 3: Euler’s Line 


We showed in Example 1 and Example 2 that the medians of a triangle are 
concurrent at a point we call the centroid of the triangle and the altitudes of a triangle 
all meet at a point we call the orthocenter of the triangle. There is another “center” of a 
triangle. We know that three noncollinear points always lie on a circle. These noncollinear 
points form a non-degenerate triangle. The center of the circle that circumscribes the 
triangle is called the circumcenter of the triangle. It is a famous theorem of Leonard Euler 
(1707-1783), a prolific contributor to many branches of mathematics, that the centroid, 
orthocenter and circumcenter of a triangle are collinear. The line containing the three 
“centers” of a triangle is called the Euler line of the triangle. We state the theorem below 
and demonstrate how encoding the hypotheses and conclusion as polynomial equations 


allows us to prove this geometric theorem using the Groebner Basis Algorithm. 


Theorem 5.10. Let AABC be a triangle in the plane. Then the circumcenter, centroid 


and orthocenter of the triangle are collinear. 


We begin with a sketch that uses the same points as in Examples 1 and 2. 


< ra 
ZY 
SEK 


E 


F 


Figure 5.3: AABC with Euler’s Line 


As in the previous examples, we place A at the origin and align AB along the 


horizontal axis. So we have A = (0,0), B = (u1,0), C = (u2,ug). Notice that the 
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coordinates of A, B, and C' completely determine the coordinates of the other points. 
The coordinates of the midpoints are as in Example 1, My; = (21,22), Mz = (3,24), 
M3 = (x5,x6). The coordinates of the centroid are M = (27,28). Now in Example 2, 
we re-used coordinates 11, %2,...,2g So we will need to reassign variables the the points 
D, E, F, and H. Let D = (29,210), E = (#11, 412), F = (#13, 214). The coordinates of 
the orthocenter are H = (215,216). The only new point in our figure is the center O of the 
circle that circumscribes AABC. Its coordinates are also dependent on the locations of 
the vertices so we assign it coordinates O = (217, 71g). We already have the 16 hypothesis 
polynomial equations from Examples 1 and 2. We will use the distance formula to show 
OA = OB and OA = OC making O is the circumcenter of AABC as follows: 


OA? = OB? : xi + aig = ("17 — 1)? + ig 


OA? = OC? : 227 +. 23, = (217 — ue)? + (21g — uz)”. 


The polynomial equations produced are 
hi7 = ur = 220171 = 0 
hig = us + u; — 2x17uU2 — 2x;gu3 = 0. 


The conclusion that M, H, and O are collinear is converted to a polynomial equation by 


showing the slopes between pairs of points are equal as follows: 


M,H,O collinear: =8=*16 — 2187 7s | 
are L17—-L15 ©17—-L7 


The polynomial equation for the conclusion is 


g = (218 — 216) (217 — 27) — ig = @8)( 217 — Zis) = 0: 
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A complete list of the polynomial equations for the hypotheses is 


hy = 2x41 — uy — ua = 0 
hg = 2x2 — ug = 0 

hg = 243 — ug = 0 

ha = 224 — uz = 0 

hs = 245 — u, = 0 

he = xg = 0 


hz =S@1ts — Loly7 = 0 


= 
oo 
| 


= £4(%7 — U1) — £g(x3 — U1) = 0 


hg = ©10(u2 — 1) — u3(ag — U1) = 0 
hig = T1903 + X9(ug — u1) = 0 
hy = Lig — 211 U3 = 0 
hig = £12U3 + U2(@11 — U1) = 0 
hi3 = 214 =0 


hia = ug — 213 = 0 


= 

a 

on 
I 


L10L15 — L916 = O 
hig = £12(@15 — 1) — L16(%11 — ui) = 0 
hi7 = ut = 22017U1 =0 


2 2 
hig = Ug + UZ — 2X17UQ — 2X1gU3 = 0. 


Using a computer to determine if g follows strictly from hy,...h 1g, we compute a reduced 
Groebner basis for the ideal (hi,...,h1s,1 — yg) C Riu, ua, u3,01,..., 218, y]. The com- 
puter does not render a Groebner basis {1} so we check to see if g follows generically by 
treating the independent variables as elements of the field of coefficients. We compute 
the reduced Groebner basis for (hi,..., hig, 1 — yg) C R(u1, ua, us, )[v1,--., 21g, y]. This 
time the computer renders the reduced Groebner basis {1} which tells us that g van- 
ishes on V(hi,...,h1g). This confirms our conclusion that the centroid, orthocenter and 
circumcenter of a triangle are collinear. In these examples, we had a specific conclusion 
we wished to validate. To demonstrate the Groebner Basis Algorithm, we chose exam- 
ples appropriate for the Cartesian plane but this algorithm is not restricted to proving 


theorems in Euclidean space. 
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Chapter 6 


Conclusion 


There are several strategies that can be used to prove geometric theorems. 
Among them are proofs learned in secondary geometry that employ postulates, defi- 
nitions and theorems arranged in paragraphs, two columns or flow charts to reason from 
the given hypothesis to the desired conclusion, proofs by geometric construction using 
a compass and straight edge and coordinate proofs. In this thesis, we explored the cor- 
respondence between algebra and geometry and demonstrated how the Groebner Basis 
Algorithm, an algebraic approach, can be used to prove geometric theorems. In partic- 
ular, we explored the correspondence between varieties and ideals, specifically, radical 
ideals. 

An important part of our study concerned the mappings from ideals to varieties 
and varieties to ideals. A crucial result was finding the one-to one correspondence be- 
tween varieties and radical ideals. Once we expressed our hypotheses and conclusions as 
polynomials, this bijection allowed us to translate a variety into an ideal and make use of 
technology to determine if the conclusion polynomial vanished on the variety comprised 
of the hypothesis polynomials by determining if the conclusion polynomial was a member 
of the corresponding radical ideal. Translating the variety to a radical ideal is what made 
the use of technology possible. 

Applying the Groebner Basis Algorithm made for efficient interpretation of the 
output from our computer application. The proof of Hilbert’s Nullstellensatz revealed a 
convenient way to revise the ideal so that the reduced Groebner basis for the revised ideal 
gave us a clear decision regarding the verification of the theorem. If the output was {1} 
then we concluded that the theorem was verified. We encountered a problem with the 


possibility of degenerate cases and resolved the issue by modifying the field of coefficients 
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to include the independent variables. 

Throughout this thesis, we laid the foundation for the use of technology to not 
only assist with our proofs of geometric theorems but to carry out all of the compu- 
tational tasks. The use of technology is a key component of algebraic geometry. The 
ability to correlate geometric concepts such as affine varieties to the algebra of polyno- 
mial rings allows us to take advantage of the algorithmic nature of computer applications 
to prove geometric theorems algebraically. For a mathematician, discovering the links 
between various branches of mathematics enhances understanding and motivates future 


exploration. 
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